
Data Engineer
Key Responsibilities
The Data Engineer is to be responsible for managing the dataset lifecycle while being the key point of contact for dataset-related questions and documenting the entire data management policy process from data pre-acquisition to data deconstruction. The Data Engineer is to work with other agencies to collect verified data, understand the datasets fully, maintain the datasets and storage, ensuring proper access and use of the datasets, and rectify any issues and errors faced with the datasets.
Requirements:
- Manage and organize large datasets, including data acquisition, cleaning, transformation, and storage.
- Develop and implement data governance policies and procedures to ensure the accuracy, completeness, and security of our data.
- Collaborate with cross-functional teams to design and implement data solutions that meet business requirements.
- Develop and maintain data dictionaries, metadata, and other documentation to support data management and analysis.
- Develop and maintain data quality metrics and processes to monitor and improve data quality.
- Provide technical leadership and guidance to the team, including mentoring junior data analysts.
- Stay up-to-date with emerging trends and technologies in data management and analysis.
- Strong understanding of data governance and data quality principles.
- Experience with data visualization tools such as Tableau or Power BI.
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
Preferred Qualifications:
- Degree/ qualifications in Computer Science, Information Management, or a related field.
- Experience with data management tools such as SQL, Python, or R.
- Professional certifications in data management, such as Certified Data Management Professional (CDMP) or Data Management Association Certified Data Management Professional (DAMA-CDMP).
- Experience with big data technologies such as Hadoop or Spark.
- Experience with cloud-based data management platforms such as AWS or Azure.
- Experience with machine learning and predictive analytics.
Key Responsibilities
The Data Engineer is to be responsible for managing the dataset lifecycle while being the key point of contact for dataset-related questions and documenting the entire data management policy process from data pre-acquisition to data deconstruction. The Data Engineer is to work with other agencies to collect verified data, understand the datasets fully, maintain the datasets and storage, ensuring proper access and use of the datasets, and rectify any issues and errors faced with the datasets.
Requirements:
- Manage and organize large datasets, including data acquisition, cleaning, transformation, and storage.
- Develop and implement data governance policies and procedures to ensure the accuracy, completeness, and security of our data.
- Collaborate with cross-functional teams to design and implement data solutions that meet business requirements.
- Develop and maintain data dictionaries, metadata, and other documentation to support data management and analysis.
- Develop and maintain data quality metrics and processes to monitor and improve data quality.
- Provide technical leadership and guidance to the team, including mentoring junior data analysts.
- Stay up-to-date with emerging trends and technologies in data management and analysis.
- Strong understanding of data governance and data quality principles.
- Experience with data visualization tools such as Tableau or Power BI.
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
Preferred Qualifications:
- Degree/ qualifications in Computer Science, Information Management, or a related field.
- Experience with data management tools such as SQL, Python, or R.
- Professional certifications in data management, such as Certified Data Management Professional (CDMP) or Data Management Association Certified Data Management Professional (DAMA-CDMP).
- Experience with big data technologies such as Hadoop or Spark.
- Experience with cloud-based data management platforms such as AWS or Azure.
- Experience with machine learning and predictive analytics.