[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free

higgsfield_ai@alien.top · 10 months ago

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free

TotesMessenger@alien.top · 10 months ago

I’m a bot, bleep, bloop. Someone has linked to this thread from another place on reddit:

[/r/datascienceproject] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free (r/MachineLearning)

^(If you follow any of the above links, please respect the rules of reddit and don’t vote in the other threads.) ^(Info ^/ ^[1](/message/compose?to=/r/TotesMessenger))

Contact ↩︎

0zyman23@alien.top · 10 months ago

Wow, you guys are the best, could you also add estimated time for my run to start, thinking if i ll get something in meaningful time, but the mere fact things like this exist is great

ginger_turmeric@alien.top · 10 months ago

Do you allow training of other sorts of models? I want to train a TTS model.

higgsfield_ai@alien.top · 10 months ago

We support only large models (starting from 7B).

badabummbadabing@alien.top · 10 months ago

By ‘training’, I assume you mean fine-tuning or LoRA?

higgsfield_ai@alien.top · 10 months ago

We only do full fine-tune.

light24bulbs@alien.top · 10 months ago

Are you having good luck with adding knowledge to the model? I tried this with llama for a couple weeks when things were just getting going and I just could not find good hyperparameters for fine tuning. I was also doing Lora so…idk.

higgsfield_ai@alien.top · 10 months ago

From our experience, to get a very good results you need

High quality dataset. It’s worth to spend more time on data cleaning. It’s way better to have a smaller dataset with high quality points than a huge dataset with garbage.
You need to fully finetune it.

Thistleknot@alien.top · 10 months ago

Same

yashdes@alien.top · 10 months ago

Don’t leave us hanging, what does the cluster look like? (ignore if you’re not allowed to share, but I’m a gigantic hardware nerd)

0zyman23@alien.top · 10 months ago

In terms of their capacity nothing crazy, Its probably a standard H100 or A100 cluster, 32 or 64 gpus

MrEloi@alien.top · 10 months ago

Why are you hiding who you are, and how many GPUs you have … and if you have legal access to them?

kalakau@alien.top · 10 months ago

What’s with the tendency for software engineers to name their libraries after fundamental physics? As a physicist this always bothered me. I’ll search for numerical algorithms for doing real physics… and end up with some garbage blockchain app or a Rust crate that does nothing

0zyman23@alien.top · 10 months ago

Giving their gpu for free - this is some iq 200 stuff