Skip to content

Commit 03a23de

Browse files
committed
edit readme of neuron
1 parent bb1bd5b commit 03a23de

File tree

1 file changed

+16
-4
lines changed

1 file changed

+16
-4
lines changed

neuron/Readme.md

Lines changed: 16 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,17 +2,28 @@
22

33
AWS Neuron (Tranium, Tranium1, Inferentia, Inferentia2 ) 에 관련 링크, 튜토리얼, 가이드를 제공 합니다.
44

5-
Last updated: Feb 25, 2024
5+
Last updated: Mar 31, 2024
66

77
---
88

99

1010
# 1. Quick Links
11-
- AWS Neuron 공식 문서[AWS Neuron Documentation](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/)
11+
## AWS Neuron
12+
- AWS Neuron 공식 문서: [AWS Neuron Documentation](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/)
1213
- AWS Neuron 공식 Git Repo: [aws-neuron-samples](https://github.com/aws-neuron/aws-neuron-samples)
13-
- AWS Neuron Roadmap 으로서 완료, 진행 중인 기능 및 모델 확인: [AWS Neuron Roadmap](https://github.com/orgs/aws-neuron/projects/1)
14+
- Trainium 에서 지원 하는 모델 확인: [Training Samples/Tutorials](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/models/training-trn1-samples.html#model-samples-training-trn1)
15+
- Inferentia2/Trainium 에서 지원 하는 모델 확인: [Inference Samples/Tutorials (Inf2/Trn1)
16+
](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/models/inference-inf2-trn1-samples.html#model-samples-inference-inf2-trn1)
17+
- Inferentia 에서 지원 하는 모델 확인: [Inference Samples/Tutorials (Inf1)
18+
](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/models/inference-inf1-samples.html#model-samples-inference-inf1)
19+
20+
## Hugging Face Optimum Neuron
1421
- Hugging Face 로 쉽게 AWS Neuron 활용: [Hugging Face Optimum Neuron](https://huggingface.co/docs/optimum-neuron/index)
22+
- Hugging Face Optimum Neuron 지원 아키텍처: [지원 아키텍처](https://huggingface.co/docs/optimum-neuron/package_reference/supported_models)
1523
- Hugging Face Optimum Neuron Git Repo: [Optimum-neuron git](https://github.com/huggingface/optimum-neuron.git)
24+
25+
## vLLM
26+
- [Installation with Neuron](https://docs.vllm.ai/en/latest/getting_started/neuron-installation.html)
1627

1728
<p>
1829

@@ -26,7 +37,8 @@ Last updated: Feb 25, 2024
2637
- (Feb 2024) [AWS Inferentia 기반 위에 llama-2-13B 이용하여 챗봇 데모](hf-optimum/01-Chatbot-Llama-2-13B-Inf2/README.md)
2738
- (Feb 2024) [AWS Tranium 기반 위에 llama-2-7B 및 Dolly Dataset 으로 파인 튜닝](hf-optimum/02-Fine-tune-Llama-7B-Trn1/README.md)
2839

29-
40+
## 2.3. vLLM on Inferentia/Trainium
41+
- (Mar 2024) [vLLM 으로 Inferentia2 (inf2.48xlarge)에서 배치성 추론 하기](vLLM/01-offline_inference_neuron.ipynb)
3042

3143
# 3. 관련 블로그
3244
- [주요 블로그 보기](blog/Readme.md)

0 commit comments

Comments
 (0)