Software Engineer (Quantization Engineer)
About this role
About Algorithm Team - Model Compression Part
LLM Quantizationμ΄ μΆλ‘ ν¨μ¨μ±μ κ·Ήλνν μ μλ€λ μ μ λ리 μλ €μ Έ μμ΅λλ€. κ·Έλ¬λ μ΄λ₯Ό μ€μ μλΉμ€μ μ μ©νλ κ²μ μ¬μ ν μ΄λ €μ΄ κ³Όμ μ λλ€. Model Compression Partλ μ¬μ©μ μΉνμ μΈ Model Compression λꡬλ₯Ό κ°λ°ν΄ μ΄λ¬ν μ΄λ €μμ ν΄κ²°νκ³ , κ³ κ°μ΄ μμ¬ NPUλ₯Ό μ΅κ³ μ ν¨μ¨λ‘ νμ©ν μ μλλ‘ μ§μνλ κ²μ λͺ©νλ‘ ν©λλ€.
Model Compression λκ΅¬κ° Hardware-specific μ΅μ νλ₯Ό ν¬ν¨ν λ, ν¨μ¨μ±μ κ·Ήλνν μ μμ΅λλ€. μ°λ¦¬λ μ΄λ¬ν μꡬλ₯Ό μΆ©μ‘±νκΈ° μν΄ μμ¬ NPUμ νΉνλ μ΅μ ν κΈ°λ₯μ κ°μΆ μ체 λꡬλ₯Ό κ°λ°νμμΌλ©°, μ΄λ₯Ό ν΅ν΄ NPUμ μ±λ₯μ μ΅λλ‘ λμ΄μ¬λ¦΄ μ μλ νμ μννΈμ¨μ΄ μ€νμ μ 곡ν©λλ€.
FuriosaAI Model Compression λꡬλ μλν, νμ₯μ±, μμ μ±μ μ§μμ μΌλ‘ κ°μ νλ©΄μ μ μ λ λ§μ κΈ°λ₯μ΄ μꡬλ©λλ€. μ΄μ λ°λΌ μννΈμ¨μ΄ μμ§λμ΄λ§ μλμ΄ λ§€μ° μ€μν μν©μ λλ€. λ°λΌμ νλΆν μννΈμ¨μ΄ μμ§λμ΄λ§ κ²½νμ 보μ νκ³ μμΌλ©°, Model Compression μμ§λμ΄λ‘μ 컀리μ΄λ₯Ό λ°μ μν€κ³ μ νλ μΈμ¬λ₯Ό μ°Ύκ³ μμ΅λλ€.
Responsibilities
Model Compression λꡬ κ°λ°
λ€μν μμνλ λͺ¨λΈ ν보 λ° μ±λ₯ κ²μ¦
μ΄λ₯Ό κΈ°λ°μΌλ‘ λ μ§λ³΄λ Compression Algorithmκ°λ°
Minimum Qualifications
PyTorch κ°λ° κ²½νμ΄ νλΆνμ λΆ
μμ© μννΈμ¨μ΄ κ°λ° κ²½νμ΄ μμΌμ λΆ
κ΄λ ¨ λΆμΌμμ 3λ μ΄μμ μ€λ¬΄ κ²½λ ₯μ 보μ νμ λΆ
Preferred Qualifications
DevOps λ° MLOpsμ λν κ²½νκ³Ό μ§μ
vLLM, TensorRT-LLM λ±μ LLM inference toolμ μ¬μ©ν κ²½ν
Deep Learning Quantization κ²½νκ³Ό μ§μ
Deep Learning κ°μκ³Ό κ΄λ ¨λ νμ¬μμμ 근무 κ²½ν
Contact
Frequently Asked Questions
Is the salary disclosed for the Software Engineer (Quantization Engineer) position at furiosa-ai?
Is the Software Engineer (Quantization Engineer) job at furiosa-ai remote?
Is the Software Engineer (Quantization Engineer) role at furiosa-ai full-time or part-time?
Which team or department does the Software Engineer (Quantization Engineer) at furiosa-ai belong to?
How do I apply for the Software Engineer (Quantization Engineer) position at furiosa-ai?
When was the Software Engineer (Quantization Engineer) job at furiosa-ai posted?
You'll be redirected to furiosa-ai's official application page on Ashby ATS.