davinci llm 3B
Overview
daVinci-LLM-3B is a 3B base language model built to make pretraining transparent and reproducible. Its release includes not only the weights, but also training trajectories, intermediate checkpoints, data-processing decisions, and more than 200 ablation studies.
Tools using davinci llm 3B
No tools found for this model yet.
