List: Convert_To_GGUF | Curated by Plaban Nayak | Medium

Nov 23, 2023

7 stories

2 saves

Convert_To_GGUF

Benjamin Marie

NeuralChat 7B: Intel’s Chat Model Trained with DPO

Distilled DPO by Intel

Nov 22, 2023

NeuralChat 7B: Intel’s Chat Model Trained with DPO

Nov 22, 2023

In

Stackademic

by

Andrew Zhu

Load up and Run any 4-bit LLM models using Huggingface Transformers

Solve the 4-bit LLM setup problems all at one time

Nov 21, 2023

Load up and Run any 4-bit LLM models using Huggingface Transformers

Nov 21, 2023

In

TDS Archive

by

Benjamin Marie

Falcon 180B: Can It Run on Your Computer?

Yes, if you have enough CPU RAM

Sep 12, 2023

Falcon 180B: Can It Run on Your Computer?

Sep 12, 2023

Benjamin Marie

VeRA: LoRA but 10x Smaller

Frozen random matrices are useful

Oct 23, 2023

VeRA: LoRA but 10x Smaller

Oct 23, 2023

Benjamin Marie

1-bit Quantization: Run Models with Trillions of Parameters on Your Computer

More and more progress toward making huge LLMs more accessible

Nov 2, 2023

1-bit Quantization: Run Models with Trillions of Parameters on Your Computer

Nov 2, 2023

shashank Jain

Running LLAMA2 (13b)on free Colab

Introduction To run LLAMA2 13b with FP16 we will need around 26 GB of memory, We wont be able to do this on a free colab version on the GPU…

Jul 23, 2023

Running LLAMA2 (13b)on free Colab

Jul 23, 2023

In

Dev Genius

by

Milind Deore

Transpose HuggingFace Models to GGUF Format

Llama.cpp provides an efficient method for running LLMs on both CPUs and GPUs. However, these two frameworks utilize different file formats…

Sep 23, 2023

Transpose HuggingFace Models to GGUF Format

Sep 23, 2023

Plaban Nayak
4.6K Followers
Machine Learning and Deep Learning enthusiast
Following
Thuwarakesh Murallie
Data Science Collective
Cristian Leo
Júlio Almeida
The Medium Blog
See all (388)

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams