Programming@programming.dev · 1 year ago

Mistral 7B AI Model Released Under Apache 2.0 License

mistral.ai

cross-posted to:
localllama@sh.itjust.works

Mistral 7B AI Model Released Under Apache 2.0 License

mistral.ai

e0qdk@kbin.social to

Programming@programming.dev · 1 year ago

cross-posted to:
localllama@sh.itjust.works

Mistral 7B

mistral.ai

The best 7B model to date, Apache 2.0

Description from the site:

Mistral AI team is proud to release Mistral 7B, the most powerful language model for its size to date.
Mistral 7B in short

Mistral 7B is a 7.3B parameter model that:

    Outperforms Llama 2 13B on all benchmarks
    Outperforms Llama 1 34B on many benchmarks
    Approaches CodeLlama 7B performance on code, while remaining good at English tasks
    Uses Grouped-query attention (GQA) for faster inference
    Uses Sliding Window Attention (SWA) to handle longer sequences at smaller cost

We’re releasing Mistral 7B under the Apache 2.0 license, it can be used without restrictions.

    Download it and use it anywhere (including locally) with our reference implementation
    Deploy it on any cloud (AWS/GCP/Azure), using vLLM inference server and skypilot
    Use it on HuggingFace

Mistral 7B is easy to fine-tune on any task. As a demonstration, we’re providing a model fine-tuned for chat, which outperforms Llama 2 13B chat.

Chat

Sigmatics@lemmy.ca
link
fedilink
arrow-up
1·
edit-2
1 year ago
That would mean 16GB are required to run this one

Programming@programming.dev

programming@programming.dev

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !programming@programming.dev

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person’s post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you’re posting long videos try to add in some form of tldr for those who don’t want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

161 users / day
1.33K users / week
3.11K users / month
9.75K users / 6 months
22 local subscribers
17.4K subscribers
1.86K Posts
28.5K Comments
Modlog