Senior (5+ years of experience)
Summary of the Role
As a Senior Big Data Engineer, you'll lead the design and development of large-scale data processing systems and provide technical leadership within the data team. You're expected to have an extensive background in software engineering, data warehousing, and big data processing technologies. Your role involves optimizing data pipelines, building the infrastructure for optimal extraction, transformation, and loading of data from various sources, and ensuring that the architecture supports the company's needs both today and in the future.
Required Skills
Expertise in programming languages such as Java, Scala, or Python.
In-depth knowledge of big data processing technologies like Apache Spark and Hadoop ecosystem.
Strong experience with NoSQL databases, such as Cassandra or MongoDB.
Proficiency in building and optimizing 'big data' data pipelines, architectures, and data sets.
Experience with stream-processing systems, such as Apache Storm or Samza.
Ability to build processes that support data transformation, data structures, metadata, dependency, and workload management.
Good understanding of distributed system design and architecture.
Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy.
Solid understanding of data security and data privacy practices.
Excellent communication and leadership skills.
Qualifications
Bachelor's or Master's degree in Computer Science, Engineering, or related field.
Minimum of 5 years' experience in a Big Data engineering role.
Proven experience with big data tools such as Hadoop, Spark, Kafka, etc.
Experience with data modeling, data warehousing, and building ETL pipelines.
Strong understanding of SQL and experience with RDBMS.
Experience with cloud services such as AWS, Azure, or Google Cloud Platform.
Familiarity with machine learning algorithms and data science techniques.
Proven ability to work with varied forms of data infrastructure, including relational databases, big data frameworks, and cloud platforms.
Experience implementing data governance and privacy policies.
Strong problem-solving skills and attention to detail.
Responsibilities
Design and implement large-scale data processing systems.
Develop and manage data warehouses and real-time processing solutions.
Lead the architectural decisions and implementation of big data technologies.
Work with cross-functional teams to identify and capture data needs of the organization.
Ensure the performance, quality, and responsiveness of data systems.
Mentor junior data engineers and review their work.
Stay current with industry trends and introduce new technologies that can enhance data capabilities.
Handle the extract, transform, load (ETL) process including data quality and consistency.
Develop data APIs for data consumption by various internal and external stakeholders.
Build analytics tools that provide actionable insights into key business metrics.