
LiXiaofang
token
Compétences

Voir mes services

Expérience professionnelle
AI Computing & Token Operation Specialist
NVIDIA • Temps plein
Jun 2024 - Present • 2 yrs
Managed GPU cluster resource allocation and bulk Token production system for mainstream LLMs including Llama, Qwen, GPT series. Optimized computing cost and token consumption rules, helped over 120 global clients cut their AI running expense by 35%~45%. Responsible for API docking, private LLM deployment and customized token quota solution design.
AI Technical Consultant
Microsoft • Temps plein
Apr 2023 - Apr 2024 • 1 yr
Provided one-on-one consultation for global AI startups & individual developers, including GPU model selection, computing budget calculation, token pricing planning and LLM interface access guidance. Completed more than 40 lightweight AI resource architecture optimization projects.