Llama 3 - Search

About 77,500,000 results

Open links in new tab

Any time

reddit.com
https://www.reddit.com › LocalLLaMA › comments
Official Llama 3 META page : r/LocalLLaMA - Reddit
Apr 18, 2024 · Llama 3 models take data and scale to new heights. It’s been trained on our two recently announced custom-built 24K GPU clusters on over 15T token of data – a training …
reddit.com
https://www.reddit.com › LocalLLaMA › comments
Llama 3: Dense Evolution or Expert Revolution? - Reddit
Sep 16, 2023 · 63 votes, 34 comments. What are your predictions about Llama 3? Will it be another dense model (with maybe 300 billion parameters and 6 trillion…
reddit.com
https://www.reddit.com › MachineLearning › comments
[D] How to and Deploy LLaMA 3 Into Production, and Hardware ...
Apr 23, 2024 · Deploying LLaMA 3 8B is fairly easy but LLaMA 3 70B is another beast. Given the amount of VRAM needed you might want to provision more than one GPU and use a …
reddit.com
https://www.reddit.com › LocalLLaMA › comments
Quantizing Llama 3 8B seems more harmful compared to other …
For Llama 3 8B, using Q_6k brings it down to the quality of a 13b model (like vicuna), still better than other 7B/8B models but not as good as Q_8 or fp16, specifically in instruction following.
reddit.com
https://www.reddit.com › LocalLLaMA › comments
Hermes 2 Theta Llama-3 8B | Llama 3 + Hermes 2 Pro merge | by …
This model surpasses both Hermes 2 Pro and Llama-3 Instruct on almost all benchmarks tested, retains its function calling capabilities, and in all our testing, achieves the best of both worlds …
reddit.com
https://www.reddit.com › LocalLLaMA › comments
Llama 3 hardware recommendation help : r/LocalLLaMA - Reddit
Apr 30, 2024 · One of our company directors has decided we need to go 'All in on AI'. I have been tasked with estimating the requirements for purchasing a server to run Llama 3 70b for around …
reddit.com
https://www.reddit.com › LocalLLaMA › comments
‍⬛ LLM Comparison/Test: Llama 3 Instruct 70B + 8B ... - Reddit
Llama 3 rocks! Llama 3 70B Instruct, when run with sufficient quantization (4-bit or higher), is one of the best - if not the best - local models currently available.
reddit.com
https://www.reddit.com › mlops › comments
How to Install and Deploy LLaMA 3 Into Production on AWS EC2
Apr 23, 2024 · Deploying LLaMA 3 8B is fairly easy but LLaMA 3 70B is another beast. Given the amount of VRAM needed you might want to provision more than one GPU and use a …
reddit.com
https://www.reddit.com › LocalLLaMA › comments
4bit pre-quantized Llama-3 8B bitsandbytes uploaded!
Just uploaded 4bit pre quantized bitsandbytes (can do GGUF if people want) versions of Llama-3's 8b instruct and base versions on Unsloth's HF page! https://huggingface.co/unsloth …
reddit.com
https://www.reddit.com › ... › comments
Zuckerberg says they are training LLaMa 3 on 600,000 H100s
Jan 18, 2024 · They're not training LLaMA on 650k H100s They're not looking to have 650k H100s only 350k. They haven't mentioned how many or what GPUs they're training LLaMA-3 …
Pagination
- 1
- 2
- 3
- 4
- Next