Hadoop Administrator 1 (Data Services)
San Antonio, TX 
Share
Posted 27 days ago
Job Description
Hadoop Administrator 1 (Data Services)

Date: Apr 5, 2024

Location: San Antonio, TX, US, 78205

Company: CPS Energy

We are engineers, high line workers, power plant managers, accountants, electricians, project coordinators, risk analysts, customer service operators, community representatives, safety and security specialists, communicators, human resources partners, information technology technicians and much, much more. We are 3,300 people committed to enhancing the lives of the communities we serve. Together, we are powering the growth and success of our community progress every day!

Position Summary

The Hadoop administrator is responsible for the care, maintenance, administration, and reliability of the Hadoop ecosystem. The role includes ensuring system security, stability, reliability, capacity planning, recoverability (protecting business data) and performance. In addition to providing new system and data management solution delivery to meet the growing and evolving data demands of the enterprise. Hadoop administrator using Cloudera, administers Cloudera technology and systems responsible for backup, recovery, architecture, performance tuning, security, auditing, metadata management, optimization, statistics, capacity planning, connectivity, and other data solutions of Hadoop systems.

GRADE: 14*

*Qualifications may warrant placement in a different job level

DEADLINE TO APPLY: Open Until Filled

Tasks and Responsibilities
  • Hadoop administrator provides support and maintenance and its eco-systems including HDFS, Yarn, Hive, LLAP, Druid, Impala, Spark, Kafka, HBase, Cloudera Work Bench, etc.
  • Accountable for storage, performance tuning and volume management of Hadoop clusters and MapReduce routines
  • Deploys Hadoop cluster, add and remove nodes, keep track of jobs, monitor critical parts of the cluster, configure name-node high availability, schedule and configure it and take backups.
  • Installs and configures software, installs patches, and upgrades software as needed.
  • Capacity planning and implementation of new/upgraded hardware and software releases for storage infrastructure.
  • Involves designing, capacity arrangement, cluster set up, performance fine-tuning, monitoring, structure planning, scaling and administration
  • Communicates with other development, administrating and business teams. They include infrastructure, application, network, database, and business intelligence teams.
  • Responsible for Data Lake and Data Warehousing design and development.
  • Collaboration with various technical/non-technical resources such as infrastructure and application teams regarding project work, POCs (Proofs of Concept) and/or troubleshooting exercises.
  • Configuring Hadoop security, specifically Kerberos integration with ability to implement.
  • Creation and maintenance of job and task scheduling and administration of jobs.
  • Responsible for data movement in and out of Hadoop clusters and data ingestion using Sqoop and/or Flume
  • Review Hadoop environments and determine compliance with industry best practices and regulatory requirements.
  • Data modeling, designing and implementation of data based on recognized standards.
  • Working as a key person for Vendor escalation
  • On-call rotation is required to support 24/7 environment and is also expected to be able to work outside business hours to support corporate needs.
  • Performs other duties as assigned.
Minimum Skills
Minimum Knowledge and Abilities
Intermediate experience in a Hadoop production environment.
Must have intermediate experience and expert knowledge with at least 4 of the following:
Hands on experience with Hadoop administration in Linux and virtual environments.
Well versed in installing & managing distributions of Hadoop (Cloudera).
Expert knowledge and hands-on experience in Hadoop ecosystem components; including HDFS, Yarn, Hive, LLAP, Druid, Impala, Spark, Kafka, HBase, Cloudera Work Bench, etc.
Thorough knowledge of Hadoop overall architecture.
Experience using and troubleshooting Open Source technologies including configuration management and deployment.
Data Lake and Data Warehousing design and development.
Experience reviewing existing DB and Hadoop infrastructure and determine areas of improvement.
Implementing software lifecycle methodology to ensure supported release and roadmap adherence.
Configuring high availability of name-nodes.
Scheduling and taking backups for Hadoop ecosystem.
Data movement in and out of Hadoop clusters.
Good hands-on scripting experience in a Linux environment.
Experience in project management concepts, tools (MS Project) and techniques.
A record of working effectively with application and infrastructure teams.
Strong ability to organize information, manage tasks and use available tools to effectively contribute to a team and the organization.
Valid Class C Texas Driver's License.
Makes independent recommendations.
Preferred Qualifications
  • Master's degree in Information Systems or related field from an accredited university.
  • Cloudera, Big Data and Hadoop certifications are a plus.
  • Utility Industry experience.
  • NERC (North American Electric Reliability Corporation) background and experience.
Competencies
Demonstrating Initiative
Displaying Technical Expertise
Serving Customers
Learning Quickly
Using Computers and Technology
Minimum Education
Bachelor's degree in Information Systems, Engineering, Computer Science, or related field from an accredited university.
Required Certifications
Working Environment
Indoor work, operating computer, manual dexterity, talking, hearing, repetitive motion. Use of personal computing equipment, telephone, multi-functioning printer and calculator.
Ability to travel to and from meetings, training sessions or other business related events. After hours work may be required. Overnight travel may be required.
Physical Demands
Exerting up to 10 pounds of force occasionally, and/or a negligible amount of force frequently or constantly to lift, carry, push, pull or otherwise move objects, including the human body.
Sedentary work involves sitting most of the time. Jobs are sedentary if walking and standing are required only occasionally, and all other sedentary criteria are met.

CPS Energy does not discriminate against applicants or employees. CPS Energy is committed to providing equal opportunity in all of its employment practices, including selection, hiring, promotion, transfers and compensation, to all qualified applicants and employees without regard to race, religion, color, sex, sexual orientation, gender identity, national origin, citizenship status, veteran status, pregnancy, age, disability, genetic information or any other protected status. CPS Energy will comply with all laws and regulations.


Nearest Major Market: San Antonio

Job Segment: Power Plant, Data Warehouse, System Administrator, Computer Science, Systems Engineer, Energy, Technology, Engineering

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Education
Bachelor's Degree
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields