Data Engineer (Python)

Full Time, onsite
ST ENGINEERING
Jurong East, Singapore

Salary undisclosed

Checking job availability...

Original

Simplified

ST Engineering is a global technology, defence and engineering group with offices across Asia, Europe, the Middle East and the U.S., serving customers in more than 100 countries. The Group uses technology and innovation to solve real-world problems and improve lives. Our dedication to excellence and our strong track record have earned us a distinctive reputation for quality and trust spanning across the aerospace, smart city, defence and public security segments. Headquartered in Singapore, ST Engineering ranks among the largest companies listed on the Singapore Exchange.Join our Cyber TeamWe are an industry leader in cybersecurity with over two decades of experience, we deliver a holistic suite of trusted cybersecurity solutions to empower cyber resilience for government and ministries, critical infrastructures, and commercial enterprises. Backed by our indigenous capabilities and deep domain expertise, we offer robust cyber-secure products and services in cryptography, cybersecurity engineering, digital authentication, SCADA protection, audit and compliance. We specialise in the design and build of security operations centres for cybersecurity professionals and provide managed security services to strengthen the cybersecurity posture of our government and enterprise customers.We work with organizations to protect them from cyber threats. We help detect new unknown cyber threats through research, fortify networks, and defend critical information infrastructures. Our team consists of cybersecurity experts who use the advanced techniques and technologies to deliver cutting-edge solutions and services.We are seeking a Data Engineer (Python) who is passionate about data engineering, graph analytics, and large-scale data management, capable of delivering high-quality solutions in complex environments.Responsibilities:Data Pipeline Development & Optimization:

Design, develop, and maintain scalable data pipelines for real-time and batch processing.
Implement efficient data extraction, transformation, and loading (ETL) processes.
Optimize data workflows for performance, scalability, and reliability.

Graph Database & Network Analysis:

Work with graph databases such as Neo4j and NetworkX to model and analyze complex relationships.
Develop and optimize graph-based data structures for efficient querying and analysis.
Implement algorithms for network analysis, anomaly detection, and pattern recognition.

Graph Database Management & Data Modeling:

Design and implement graph database solutions tailored for data engineering use cases.
Develop data models that support efficient data storage, retrieval, and analysis within graph databases.
Ensure data integrity, consistency, and security through best practices in graph database management.

Big Data & Streaming Frameworks:

Work with big data frameworks such as Hadoop and Spark for large-scale data processing.
Develop and optimize data lake architectures to support efficient data storage and retrieval.
Utilize streaming frameworks such as Apache Kafka and Apache Flink for real-time data processing.

Collaboration & Cross-Functional Integration:

Work closely with data scientists, analysts, and software engineers to translate business requirements into technical solutions.
Collaborate on the integration of data pipelines with machine learning models and analytical applications.
Provide technical mentorship and guidance to junior team members.

Data Quality & Governance:

Implement data validation, monitoring, and governance strategies to ensure high data quality.
Establish best practices for data lineage, documentation, and version control.
Ensure compliance with data privacy and security policies.

Requirements:Experience:

5+ years of professional experience in data engineering or related fields.
Proven experience in designing and maintaining large-scale data pipelines and processing frameworks.
Experience working with graph databases such as Neo4j and tools like NetworkX is required.
Experience in cybersecurity, finance, healthcare, or other high-stakes domains is a plus.

Skills:

Proficiency in Python, with strong expertise in data manipulation libraries (Pandas, NumPy, Pytorch Geometric, Networkx, etc.).
Strong understanding of graph databases, network analysis, and related algorithms.
Expertise in Cypher query language for graph database operations.
Experience with big data frameworks such as Hadoop and Spark.
Knowledge of data lakes architecture for scalable data storage and retrieval.
Experience with streaming frameworks like Apache Kafka or Apache Flink for real-time data processing.

Software Development & Engineering:

Strong understanding of data structures, algorithms, and distributed computing concepts.
Experience with API development and integration.
Familiarity with version control (Git) and automated testing frameworks.

Preferred Qualifications:

Experience in designing and deploying graph-based data solutions.
Certifications in data engineering, big data technologies, or database management.
Knowledge of cybersecurity data analysis and anomaly detection techniques.

Work Location: Jurong EastFind out more:ST Engineering believes in fostering a culture where team members are encouraged to overcome challenges, explore new ideas, and work together to succeed. We value individuals who are determined to push beyond the boundaries, and have a thirst for knowledge, continuous learning, and self-improvement.

Design, develop, and maintain scalable data pipelines for real-time and batch processing.
Implement efficient data extraction, transformation, and loading (ETL) processes.
Optimize data workflows for performance, scalability, and reliability.