Please note that the expected salary is an estimation. Negotiation of salary will be after the final round of interviews.
1, 2+ years of development experience, familiar with LLM, Python, Flask
2, Ph.D. in AI from Macau University of Science and Technology
3, Good performance in logical thinking and application design, can continuously optimize the algorithm according to business scenarios to improve the correctness of the algorithm.
1. Proficient in programming languages such as C++ and Python, image processing tools like OpenCV, VTK, PCL, and Halcon, and interface development using Qt, Visual Studio, CMake, etc.
2. Familiar with classic machine learning algorithms like LR, deep learning frameworks such as PyTorch, and TensorFlow, object detection algorithms like YOLO, image segmentation algorithms like UNet, tracking algorithms like SORT, and commonly used NLP models and techniques including text classification, named entity recognition, sentiment analysis, etc.
深度学习和 LLMs 大模型:
- 对大模型训练所需的数据有较深认知,对大模型训练原理有较深了解,熟练掌握大语言模 型/NLP 数据标注要求及流程,尤其金融行业
- 熟练掌握 CHATGPT Prompt 及 Instruction Fine Tune 技术,能快速搭建应用,实现企业 模型私有化
- 熟悉 Tensorflow 及 Huggingface 等深度学习框架;Transformer/ChatGPT 3.0 大模型及 算法原理;
- 了解 Difussion Model 算法原理,有使用 Opencv 经验,了解数据标注操作并理解其重要性
1 有算法的落地经验,同时也有NLP相关的工作经验
2 在百度从事的工作涉及多个方向,包括视觉和NLP
3 在NLP方面,与百度知识图谱部门合作进行了命名实体识别的项目,也参与了最近与文心一言团队合作,一起进行大模型的训练和压缩
熟悉 stable diffusion 文生图, 图生图 AIGC相关技术的prompt 调优
熟悉 KNN 模型分类,熟悉大规模模型算法
熟悉 tf-idf 模型做文本相似度
熟悉基于 词频, 词袋,词性, 短句, 搭建机器学习模型进行文本分析
熟悉 数据挖掘相关的技能, 例如: 二值化, 对数变化, min-max, Z-Score, L1/L2正则, Logistic
8年算法经验,参与多个算法项目的工程落地,设计金融、政务、运维等领域
熟练掌握ML/DL/NLP/LLM等知识体系,LLM体系:LLM(LLaMA/ChatGLM),预训练,微调、部署等
责任心强,前瞻性视野,前沿调研能力强
在LLM、AIGC有丰富的项目经验,目前百度带领大团队做大模型预训练
熟练掌握以下技术:
Research Interests: BioComputing (AI4S), AIGC (ChatGPT, diffusion model), Multi-Modal pretraining, First-Principle driven
Programming Languages: Python, Java, C++/C, C#, MATLAB, SQL, R, HTML5, CSS3, JavaScript