Algorithm Engineer - Large Model

Full Time, onsite
Alpha X Technology PTE. LTD.
Central, Singapore

$ 5,500 - $ 8,000 / month

Checking job availability...

Original

Simplified

Alpha X is an innovative high-tech manufacturing technology company pioneering the integration of advanced automation, transportation and AI-driven solutions to revolutionize traditional manufacturing processes. We harness the power of artificial intelligence, machine learning, and robotics to optimize production efficiency, enhance product quality, and foster sustainability across diverse high-tech industries.

Job Responsibilities:

Participate in the company's decision-making large model product continuous optimization design and implementation, decision-making large model training data construction, data matching, model training and indicator evaluation and other related work.
Responsible for the training and research of large models, including but not limited to pre-training, SFT, Alignment, multimodal technology, etc., and explore the application of large models in the semiconductor field.
Build a proprietary large model in the semiconductor field, integrate semiconductor field knowledge, and quickly implement semiconductor business scenarios.
Responsible for researching and understanding the large model test system and data construction methods, and evaluating the full capabilities of large models in understanding, reasoning, agent, rag, etc.
Responsible for the overall design and development of the Large Language Model (LLM) model reasoning engine, optimize the engine, algorithm and model architecture, and improve computing performance.
Responsible for the exploration of cutting-edge natural language understanding technologies, including model distillation, edge computing, multi-round semantic understanding, knowledge fusion, etc.
Responsible for following up on the latest progress in the field of large models, understanding the cutting-edge trends in the industry, and continuously iterating large model-related algorithm modules to achieve product vision and goals.

Job Requirements:

Computer science, artificial intelligence and other related majors, master's degree or above, doctoral degree preferred. Have enthusiasm and confidence in the future development of large model technology.
Master the python coding language, and be proficient in one or several deep learning frameworks (such as tensorflow, pytorch, paddlepaddle, etc.).
Master the basic theories and algorithms of Natural Language Processing (NLP), have relevant industry experience in Natural Language Processing (NLP), and be proficient in the architecture and principles of the Transformer series of models.
Master natural language processing tasks, such as question-answering systems, retrieval systems, recommendation systems, knowledge graphs, reasoning graphs, sequence annotation, summary generation and extraction, etc.
Have a deep understanding of the mainstream pre-trained language large models of Natural Language Processing(GPT/LLaMA/GLM/Bloom/Qwen/BERT), and have strong exploration and practical skills.
Master the relevant technologies of large models, and have a certain understanding and exploration experience of mainstream large model application frameworks such as RAG, AI-Agent, LangChain, etc.
Master the training data construction, data ratio, training and indicator evaluation of large models PT, Continue PT, SFT, and RLHF in vertical industry fields.
Familiar with at least one large model distributed training framework such as DeepSpeed, Megatron, Colossal-ai, etc.
Familiar with LLM-related inference engines and their mainstream optimization methods, such as /Triton/FasterTransformer/vLLM, FlashAttention/PageAttention, etc.
Excellent English reading and writing skills, fluent listening and speaking skills.
Those with overseas study background are preferred.

You’ll only be the right candidate if you are aligned to our values and culture:

Collaborative entrepreneurial spirit
Winning through customers
High ethical standards, openness and trust
Expectations for results
Respect and value people

Job Responsibilities:

Participate in the company's decision-making large model product continuous optimization design and implementation, decision-making large model training data construction, data matching, model training and indicator evaluation and other related work.
Responsible for the training and research of large models, including but not limited to pre-training, SFT, Alignment, multimodal technology, etc., and explore the application of large models in the semiconductor field.
Build a proprietary large model in the semiconductor field, integrate semiconductor field knowledge, and quickly implement semiconductor business scenarios.
Responsible for researching and understanding the large model test system and data construction methods, and evaluating the full capabilities of large models in understanding, reasoning, agent, rag, etc.
Responsible for the overall design and development of the Large Language Model (LLM) model reasoning engine, optimize the engine, algorithm and model architecture, and improve computing performance.
Responsible for the exploration of cutting-edge natural language understanding technologies, including model distillation, edge computing, multi-round semantic understanding, knowledge fusion, etc.
Responsible for following up on the latest progress in the field of large models, understanding the cutting-edge trends in the industry, and continuously iterating large model-related algorithm modules to achieve product vision and goals.

Job Requirements:

Computer science, artificial intelligence and other related majors, master's degree or above, doctoral degree preferred. Have enthusiasm and confidence in the future development of large model technology.
Master the python coding language, and be proficient in one or several deep learning frameworks (such as tensorflow, pytorch, paddlepaddle, etc.).
Master the basic theories and algorithms of Natural Language Processing (NLP), have relevant industry experience in Natural Language Processing (NLP), and be proficient in the architecture and principles of the Transformer series of models.
Master natural language processing tasks, such as question-answering systems, retrieval systems, recommendation systems, knowledge graphs, reasoning graphs, sequence annotation, summary generation and extraction, etc.
Have a deep understanding of the mainstream pre-trained language large models of Natural Language Processing(GPT/LLaMA/GLM/Bloom/Qwen/BERT), and have strong exploration and practical skills.
Master the relevant technologies of large models, and have a certain understanding and exploration experience of mainstream large model application frameworks such as RAG, AI-Agent, LangChain, etc.
Master the training data construction, data ratio, training and indicator evaluation of large models PT, Continue PT, SFT, and RLHF in vertical industry fields.
Familiar with at least one large model distributed training framework such as DeepSpeed, Megatron, Colossal-ai, etc.
Familiar with LLM-related inference engines and their mainstream optimization methods, such as /Triton/FasterTransformer/vLLM, FlashAttention/PageAttention, etc.
Excellent English reading and writing skills, fluent listening and speaking skills.
Those with overseas study background are preferred.

You’ll only be the right candidate if you are aligned to our values and culture:

Collaborative entrepreneurial spirit
Winning through customers
High ethical standards, openness and trust
Expectations for results
Respect and value people