'huggingface' 태그의 글 목록

728x90

huggingface 1

Text Generation Interface (TGI) Review

TGI DocumentTGI의 소개 페이지에서는 맨 처음 여러가지 최적화와 기능들을 구현했다고 말하고 있습니다.그 중에서 다음 몇가지 항목들에 대한 리뷰를 하고 정리해보겠습니다.Tensor Parallelism for faster inference on multiple GPUsTokne streaming using Server-Senf Events (SSE)Continuous batching of incoming requests for increased total throughputOptimized transformers code for inference using Flash Attantion and Paged Attention on the most popular architecturesQuantizat..

Tech 2025.01.29

Thinking, Writing, and.

소프트웨어 개발에 관련된 이야기, 조직문화 이야기, llm 관련 논문 리뷰, 그리고 이런저런 이야기들을 합니다.

05-22 03:42

250x250

협업, vllm, sarathi, transformer, MOE, inference, 북리뷰, 논문리뷰, Ai, deepspeed, vattention, paper리뷰, nVidia, 소프트웨어개발, LLM, GPU, 조직문화, paper, deepseek, 썩은사과,

Today :
Yesterday :

728x90

huggingface 1

티스토리툴바