Senior Data Quality Engineer Remote
Senior Data Quality Engineer Description
We are looking for an experienced Senior Data Quality Engineer to join our remote team and lead a project focused on ensuring the accuracy and completeness of data in our organization.
In this role, you will be responsible for designing, developing, and executing automated tests to validate data quality, as well as identifying and resolving data quality issues. You will work closely with cross-functional teams to ensure that data meets the organization's requirements and standards.
If you have a passion for data quality and have a proven track record of designing and implementing automated tests, we want to hear from you.
#LI-DNI#LI-AP13
Technologies
- Programming Languages: Python
- SQL - MSSQL, PostgreSQL, MySQL, Oracle
- Big Data - Hadoop, HDFS, Hive, Spark, Kafka, Flume, Sqoop
- NoSQL – Cassandra, HBase, MongoDB
- Data Visualization: Tableau, Tibco Spotfire, Power BI
- ETL - MS SSIS, Talend, MicroStrategy
- Cloud - AWS/Azure/GCP - Storage; Compute; Networking; Identity and Security; Notebooks; Data Catalogs
- MDM tools
- Data Generation
- Test Management Tools: TestRail, Zephyr
- Performance Testing: JMeter
- CI/CD principles & tools: Jenkins
- Queues and Stream processing
- Version Control Systems: Git, SVN
Responsibilities
- Be responsible for data product testing and quality of deliverables on the project
- Data quality implementation and improving data quality as the main goal
- Establish data testing approach and test design, data quality checks across the data lifecycle
- Work on priorities to achieve better data quality on the project within a tight schedule
- Design and implement data testing strategy concerning system architecture and data flows
- Plan and estimate required resources to conduct data product/flow testing and validation concerning company/industry / governmental standards
- Design and develop a data quality framework according to the identified data product strategy (define critical data, controls, and quality targets, set thresholds and alerts, etc.)
- Understand, design, and implement automated data quality checks
Requirements
- 3+ years of software engineering experience and practice in Data Management, Data Quality verification/Data Governance, Data Visualization testing, Data Integration
- Familiar with concepts of data ingestion pipelines, and data storage like OLTP databases, Data Warehousing, Data Lakes
- Understanding of the ETL process and test strategy for ETL
- Strong SQL experience
- Experience with scripting/automation
- Knowledge and practical experience in data validation
- Familiar with test documentation concepts, such as test strategies, test plans, etc
- Familiar with testing methods - TDD, BDT, DDT
- Analytical approach to problem-solving; excellent interpersonal and communication skills
- Experience in Data analysis & requirements validation
- Data-oriented personality; motivated, independent, efficient, and able to work under pressure with a solid sense for setting priorities
- Experience in direct customer communications
- Skills in infrastructure troubleshooting, support in performance tuning and optimization, bottleneck problem analysis
- Understanding of CI/CD principles
Nice to have
- Knowledge of Java , Scala, Bash, xpath
- Experience with big data technologies such as Hadoop, Spark, or Kafka
- Familiarity with cloud platforms such as AWS, GCP, or Azure
- Certifications in data quality or software engineering