Show HN: finetune LLMs via the Finetuning Hub https://ift.tt/A3w51Vq

September 04, 2023

Show HN: finetune LLMs via the Finetuning Hub https://ift.tt/A3w51Vq

Show HN: finetune LLMs via the Finetuning Hub Hi HN community, I have been working on benchmarking publicly available LLMs these past couple of weeks. More precisely, I am interested on the finetuning piece since a lot of businesses are starting to entertain the idea of self-hosting LLMs trained on their proprietary data rather than relying on third party APIs. To this point, I am tracking the following 4 pillars of evaluation that businesses are typically look into: - Performance - Time to train an LLM - Cost to train an LLM - Inference (throughput / latency / cost per token) For each LLM, my aim is to benchmark them for popular tasks, i.e., classification and summarization. Moreover, I would like to compare them against each other. So far, I have benchmarked Flan-T5-Large, Falcon-7B and RedPajama and have found them to be very efficient in low-data situations, i.e., when there are very few annotated samples. Llama2-7B/13B and Writer’s Palmyra are in the pipeline. But there’s so many LLMs out there! In case this work interests you, would be great to join forces. GitHub repo attached — feedback is always welcome :) Happy hacking! https://ift.tt/ODaQ5iU September 4, 2023 at 08:46PM

Search This Blog

Hd mp4, Hollywood DVDRip Latest movies Bollywood Dual Audio,

Show HN: finetune LLMs via the Finetuning Hub https://ift.tt/A3w51Vq

Comments

Post a Comment

Popular Posts

Show HN: Computer Engineering for Babies (Book) https://t.co/JVBVS9tf7y Show HN: Computer Engineering for Babies (Book) https://t.co/flag31aVvy August 31, 2021 at 12:32AM https://t.co/rQFjtIJb9c

Show HN: Prompteus – Visual workflow builder for shipping better AI features https://ift.tt/G0cQ649