Jeffrey Morgan @jmorgan profile

Jeffrey Morgan

@jmorgan

Followers

4,269

Following

129

Media

13

Statuses

1,537

@ollama – prev @docker , @twitter , @google

https://t.co/5QsvOHzq3A

Toronto & SF

Joined September 2010

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Adalet • 242820 Tweets

Chester • 112909 Tweets

ベイマックス • 109370 Tweets

Merchan • 102490 Tweets

Jean Carroll • 81133 Tweets

Milli • 76841 Tweets

#2KDay • 76197 Tweets

Eagles • 68396 Tweets

Ayşenur Ezgi Eygi • 40881 Tweets

Franco Escamilla • 40443 Tweets

#CristinaMostraElTitulo • 36548 Tweets

Packers • 31129 Tweets

Barcola • 31055 Tweets

Dick Cheney • 22786 Tweets

Mert • 16855 Tweets

Bafana Bafana • 15974 Tweets

Ganesh Chaturthi • 15542 Tweets

Galler - Türkiye • 14877 Tweets

Kenan • 13602 Tweets

#FRAITA • 12925 Tweets

Chaine • 12597 Tweets

Go Birds • 12344 Tweets

Sergio Mendes • 11027 Tweets

#BizimÇocuklar • 10994 Tweets

Olise • 10126 Tweets

Tee Higgins

Ömer Üründül

Spalletti

Broos

Anthony Rizzo

Eghbali

رامي ربيعة

هيثم عرابي

DJ Neptune

Mofokeng

Clauss

Zeki Çelik

Veli Mothwa

Clearlake

Kenny Ortega

Abdülkerim

دي ماركو

Boehly

AllahuEkber

Di Lorenzo

Tonali

Di Marco

Montella

Barış Alper

#FranciaItalia

Last Seen Profiles

@aine_gaston

@CharletteE55426

@SebastienneL

@Lipsow1

@JackMarder14

@augur1069_

@ORV_News

@TiaraAurellie

@tana_latos

@juank0701

@S64054000

@SissyJulietteC

@Mia_sweet_Mia

@Leo80221217

@conner_trent

@JewishonCampus_

@Mina_Yamame

@Groupquetta

@lucky_charlos

@BrysonFran51916

Jeffrey Morgan

@jmorgan

11 months

sqlcoder is an open-source LLM that converts natural language to high-quality SQL queries, making it easy to query data for even complex schemas and questions Run it locally with Ollama:

sqlcoder

SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks

ollama.com

14

160

1K

Jeffrey Morgan

@jmorgan

5 months

Mixtral 8x22B running on a MacBook Pro with Ollama Works with the latest pre-release version of 0.1.32 and will be published to soon.

20

52

433

Jeffrey Morgan

@jmorgan

8 months

TinyLlama is a 1.1B model with the Llama 2 architecture, trained on 3 trillion tokens. Its small size means it can run fast with little memory and compute requirements.

tinyllama

The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.

ollama.com

6

58

363

Jeffrey Morgan

@jmorgan

10 months

Neural Chat is a new model based on Mistral and fine-tuned by Intel. It's currently the highest-ranked 7B model on the HuggingFace H4 open-source LLM leaderboard.

neural-chat

A fine-tuned model based on Mistral with good coverage of domain and language.

ollama.com

13

49

353

Jeffrey Morgan

@jmorgan

3 months

LlamaFS: a self-organizing file manager that integrates with @ollama to run locally

GitHub - iyaja/llama-fs: A self-organizing file system with llama 3

A self-organizing file system with llama 3. Contribute to iyaja/llama-fs development by creating an account on GitHub.

github.com

8

51

350

Jeffrey Morgan

@jmorgan

2 months

Ollama 0.2 can now: * Run different models side-by-side * Process multiple requests in parallel This enables a whole new set of RAG, agent and model serving use cases. Ollama will automatically load and unload models dynamically based on how much memory is in the system.

ollama

@ollama

2 months

Ollama 0.2 is here! Concurrency is now enabled by default. This unlocks 2 major features: Parallel requests Ollama can now serve multiple requests at the same time, using only a little bit of additional memory for each request. This enables use cases

88

388

2K

10

43

287

Jeffrey Morgan

@jmorgan

11 months

zephyr is a 7B model created by the HuggingFace H4 team. It's a fine-tuned version of Mistral 7B that beats Llama 2 70B Chat on a series of benchmarks. Built-in alignment layers were removed to improve results.

zephyr

Zephyr is a series of fine-tuned versions of the Mistral and Mixtral models that are trained to act as helpful assistants.

ollama.com

7

35

261

Jeffrey Morgan

@jmorgan

6 months

Run Mistral's new base text completion model updated to v0.2 with Ollama: ollama run mistral:text

mistral:text

The 7B model released by Mistral AI, updated to version 0.3.

ollama.com

14

35

235

Jeffrey Morgan

@jmorgan

11 months

Docker + Ollama Deploy and run LLMs such as Llama 2 and Mistral in Docker using Ollama. Chat with models locally in containers + export a port to serve models over a REST api. GPU acceleration built in with both Intel and Arm image versions available

Ollama is now available as an official Docker image · Ollama Blog

Ollama can now run with Docker Desktop on the Mac, and run inside Docker containers with GPU acceleration on Linux.

ollama.com

4

44

243

Jeffrey Morgan

@jmorgan

5 months

Dolphin 2.9 Llama 3 is a new version of the popular Dolphin model by @erhartford , fine tuned from Llama 3 8B: ollama run dolphin-llama3

dolphin-llama3

Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.

ollama.com

8

27

203

Jeffrey Morgan

@jmorgan

9 months

Ollama now supports multimodal models such as LLaVA by @imhaotian

10

32

186

Jeffrey Morgan

@jmorgan

8 months

Notux is a new 8x7B mixture of experts (MoE) model by @argilla_io , created by fine-tuning Mixtral on a high-quality dataset. It's currently the top-performing mixture of experts model on the Hugging Face Open LLM Leaderboard!

notux

A top-performing mixture of experts model, fine-tuned with high-quality data.

ollama.com

5

39

172

Jeffrey Morgan

@jmorgan

9 months

Starling is a new 7 billion parameter large language model by @BanghuaZ & team. This model outperforms every model to-date except GPT-4 and GPT-4 Turbo on MT-Bench, a benchmark to assess chatbot helpfulness.

starling-lm

Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.

ollama.com

11

19

174

Jeffrey Morgan

@jmorgan

5 months

CodeGemma is a new collection of 2B and 7B models by Google that specialize in coding tasks: * Fill-in-the-middle code completion * Code generation * Mathematical reasoning * Instruction following

codegemma

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathem...

ollama.com

1

29

172

Jeffrey Morgan

@jmorgan

9 months

Dolphin 2.6 Phi-2 is a new uncensored chat model by @erhartford Since this version of Dolphin is based on the small 2.7B Phi model by Microsoft Research, it's fast and can run on a wide range of machines using @ollama .

dolphin-phi

2.7B uncensored Dolphin model by Eric Hartford, based on the Phi language model by Microsoft Research.

ollama.com

9

24

138

Jeffrey Morgan

@jmorgan

10 months

codebooga is a new 34 billion parameter code model created by the infamous Oobabooga. It's a merge of two other popular coding models and has been praised by model creator @erhartford for being the best code instruct model next to GPT-4.

codebooga

A high-performing code instruct model created by merging two existing code models.

ollama.com

5

23

133

Jeffrey Morgan

@jmorgan

11 months

Dolphin 2.1 Mistral is @erhartford 's most recent 7b, instruct-tuned model based on the popular Mistral foundational model. It's currently one of the highest performing models on the open-source LLM leaderboards

dolphin-mistral

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

ollama.com

2

20

122

Jeffrey Morgan

@jmorgan

9 months

Dolphin 2.5 Mixtral 8x7b is a new uncensored model by @erhartford based on Mixtral, the new mixture of experts model by @MistralAI . It was trained on a wide variety of datasets and is especially strong at coding tasks.

dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

ollama.com

3

16

113

Jeffrey Morgan

@jmorgan

5 months

@petergyang @ollama If on macOS, check out which works natively with Ollama.

‎Enchanted LLM

‎Enchanted is chat app for LLM researchers to chat with self hosted models. Enchanted supports Ollama API and all ecosystem models. It is necessary to have a running Ollama server to use this app and...

apps.apple.com

4

2

109

Jeffrey Morgan

@jmorgan

10 months

Orca 2 is a new small model released by Microsoft Research. It has enhanced reasoning abilities typically found only in language models 5-10x larger.

orca2

Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.

ollama.com

4

7

106

Jeffrey Morgan

@jmorgan

4 months

Fully-local AI Town! AI Town is a virtual town where characters live and interact. It has wonderful visuals, is highly customizable, and can now run entirely on your local machine!

Ian Macartney

@ianmacartney

4 months

Run an AI Town locally, powered by llama3 🎉 No cloud signups needed. Make your own world, and then talk to it :) Runs the open-source @convex_dev backend locally. Use @ollama locally or @togethercompute for cloud LLM. @realaitown

10

122

584

1

17

103

Jeffrey Morgan

@jmorgan

7 months

LLaVA 1.6 from @imhaotian has been released with improved resolution support, visual reasoning, and OCR capabilities, all while maintaining minimalist design and data efficiency.

llava

🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

ollama.com

3

7

103

Jeffrey Morgan

@jmorgan

9 months

Mixtral can now be run with @ollama

ollama

@ollama

9 months

Ollama countdown: Day 11 Mixture of experts models are now supported in v0.1.16! Mixtral 8x7B: ollama run mixtral Dolphin Mixtral ollama run dolphin-mixtral Dolphin Mixtral is an uncensored, fine-tuned model created by @erhartford ! (48GB+ memory required)

24

53

493

4

8

101

Jeffrey Morgan

@jmorgan

8 months

The Ollama Python & JavaScript libraries are now available: Python: JavaScript:

GitHub - ollama/ollama-js: Ollama JavaScript library

Ollama JavaScript library. Contribute to ollama/ollama-js development by creating an account on GitHub.

github.com

ollama

@ollama

8 months

Ollama Python & JavaScript libraries are here! Both libraries make it possible to integrate new and existing apps with Ollama in a few lines of code, and share the features and feel of the Ollama REST API. Learn more:

31

117

616

0

24

99

Jeffrey Morgan

@jmorgan

7 months

Stable LM 2 1.6B is a new small language model with competitive performance, matching and even surpassing significantly larger models.

stablelm2

Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.

ollama.com

8

12

96

Jeffrey Morgan

@jmorgan

1 month

Ollama now supports tools!

ollama

@ollama

1 month

Ollama 0.3 with tool support! You can now use tool calling with popular models such as Llama 3.1! 👇👇👇 Example tools include: - Functions & APIs - Web browsing - Code interpreter - and much more! 🧵 quick thread

53

266

1K

3

15

97

Jeffrey Morgan

@jmorgan

5 months

Running Mixtral 8x22B Instruct on a M3 MacBook Pro

ollama

@ollama

5 months

. @MistralAI 's Mixtral 8x22B Instruct is now available on Ollama! ollama run mixtral:8x22b We've updated the tags to reflect the instruct model by default. If you have pulled the base model, please update it by performing an `ollama pull` command.

23

49

409

8

6

92

Jeffrey Morgan

@jmorgan

9 months

Phi-2 is a new 2.7B small language model from Microsoft Research. Trained on a highly curated dataset of 1.4 trillion tokens, this model's performance surpasses several 13B models in common sense reasoning and language understanding.

phi

Phi-2: a 2.7B language model by Microsoft Research that demonstrates outstanding reasoning and language understanding capabilities.

ollama.com

2

16

87

Jeffrey Morgan

@jmorgan

10 months

Zephyr 7B Beta is the second model in the Zephyr series. Also based on Mistral, this iteration is fine-tuned on a distilled dataset making it even better at chat use cases. In many cases it can provide better responses than the much larger Llama 2 70B.

zephyr

Zephyr is a series of fine-tuned versions of the Mistral and Mixtral models that are trained to act as helpful assistants.

ollama.com

0

12

88

Jeffrey Morgan

@jmorgan

5 months

Stable Code 3B has been updated to a new instruct version, making it possible to use in conversations. Performance remains on par with larger models such as Code Llama 7B:

stable-code

Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.

ollama.com

1

16

79

Jeffrey Morgan

@jmorgan

5 months

@erhartford @CrusoeCloud @LucasAtkins7 @FernandoNetoAi Amazing! Congrats on the release. Added to Ollama: ollama run dolphin-llama3

1

4

77

Jeffrey Morgan

@jmorgan

8 months

TinyDolphin is a fun, new 1.1B parameter model trained by @erhartford and based on @PY_Z001 's fantastic TinyLlama project.

tinydolphin

An experimental 1.1B parameter model trained on the new Dolphin 2.8 dataset by Eric Hartford and based on TinyLlama.

ollama.com

3

13

75

Jeffrey Morgan

@jmorgan

4 months

@erhartford @Kugs1776 @winglian @ollama It's here!

dolphin-llama3:70b

Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.

ollama.com

1

6

74

Jeffrey Morgan

@jmorgan

5 months

Dolphin Mistral 2.8 is a new version of Dolphin Mistral by the amazing @erhartford , fine tuned from the recent Mistral 0.2 model with support for a context window of up to 32k tokens.

dolphin-mistral:v2.8

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

ollama.com

0

14

73

Jeffrey Morgan

@jmorgan

9 months

StableLM Zephyr is a new chat model by Stability AI: the first of its type with the efficient size of only 3B parameters. It has strong capabilities in generating contextually relevant, coherent, and linguistically accurate text.

stablelm-zephyr

A lightweight chat model allowing accurate, and responsive output without requiring high-end hardware.

ollama.com

6

8

70

Jeffrey Morgan

@jmorgan

5 months

@erhartford @MistralAI @CrusoeCloud @WinsonDabbles @abacusai 🫡 ollama run dolphin-mistral:v2.8

dolphin-mistral:v2.8

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

ollama.com

5

6

69

Jeffrey Morgan

@jmorgan

9 months

Nous Hermes 2 is the latest model by @Teknium1 and @NousResearch Based on the Yi 34B model, it's the highest performing in their "Hermes" series and excels in scientific discussion and coding tasks.

nous-hermes2

The powerful family of models by Nous Research that excels at scientific discussion and coding tasks.

ollama.com

1

9

68

Jeffrey Morgan

@jmorgan

9 months

Solar is a new 10.7B model by @upstageai that performs exceedingly well for single-turn instruct use cases.

solar

A compact, yet powerful 10.7B large language model designed for single-turn conversation.

ollama.com

5

4

57

Jeffrey Morgan

@jmorgan

3 months

WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI

WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI | Library Innovation Lab

Today we’re releasing WARC-GPT: an open-source, highly-customizable Retrieval Augmented Generation tool the web archiving community can use to explore the in...

lil.law.harvard.edu

2

4

54

Jeffrey Morgan

@jmorgan

7 months

ollama run llava:34b >>> What's in this picture? ./img_0064.jpg This is an image of a graffiti on a wall. The graffiti features a blue whale with white fins, and the words "BUILD SHIP RUN LOVE" are written in white above it.

5

50

Jeffrey Morgan

@jmorgan

1 month

A new 2B parameter model by the Google Gemma team!

ollama

@ollama

1 month

Open-source continues! . @Google 's Gemma 2 has new size! A 2B parameter model joins the 9B and 27B models. ollama run gemma2:2b

27

112

847

1

3

51

Jeffrey Morgan

@jmorgan

3 months

Run @MistralAI 's newest model, Codestral. It's their first code generation model!

codestral

Codestral is Mistral AI’s first-ever code model designed for code generation tasks.

ollama.com

ollama

@ollama

3 months

ollama run codestral

17

84

517

2

5

49

Jeffrey Morgan

@jmorgan

5 months

Llama 3 feels much less censored than Llama 2. Llama 3 has a much lower false refusal rate compared to Llama 2 (less than 1/3). It's willing to discuss topics its predecessor wouldn't! After seeing a post here by @erhartford , we tested a few prompts:

ollama

@ollama

5 months

We tried manually prompting llama 3 to see how it fares against llama 2 on many basic safety questions. It feels significantly less censored! So much better! 👀 quick read: 👇

16

38

500

3

2

42

Jeffrey Morgan

@jmorgan

9 months

Build an open-source RAG app using LlamaIndex + Mixtral + Ollama 🚀

LlamaIndex 🦙

@llama_index

9 months

Running @MistralAI 's Mixtral 8x7b on your laptop is now a one-liner! Check out this post in which we show you how to use @OLLAMA with LlamaIndex to create a completely local, open-source retrieval-augmented generation app complete with an API: Bonus: see

9

138

785

1

3

42

Jeffrey Morgan

@jmorgan

5 months

AI Inference now available in Supabase Edge Functions powered by @ollama

Supabase

@supabase

5 months

AI Inference now available in Supabase Edge Functions.

8

28

197

0

4

44

Jeffrey Morgan

@jmorgan

5 months

@AutoGPTunofish No speedup. M3 Max with 128GB unified memory.

1

43

Jeffrey Morgan

@jmorgan

5 months

@steipete I'm sorry this happened. It might have hit the context window limit in which case it will try to free up some context window – this works for most models but definitely not well for Llama 3. I've seen it too and am working on fixing it so it doesn't happen. In the meantime you

3

2

42

Jeffrey Morgan

@jmorgan

11 months

Fast model downloads 🚀 Running local models involves pulling GBs of model weights, usually leading to a long wait time before a response. Ollama 0.1.3 will now pull models in several parts simultaneously, significantly reducing time required to go from a new machine to a

2

1

41

Jeffrey Morgan

@jmorgan

1 year

Ollama on Linux Run open-source LLMs with GPU acceleration out of the box on Linux.

Download Ollama on Linux

ollama.com

ollama

@ollama

1 year

🙌 Ollama for Linux is here! 🙌 👀 Nvidia GPU support now comes out of the box 💪 WSL2 will work with Ollama + Nvidia GPU acceleration. 👨‍💻 Ollama works on cloud servers! Try it: 👇

7

27

62

3

37

Jeffrey Morgan

@jmorgan

1 year

Ollama can now be used to generate embeddings using LangChain. What's great is the same in-memory model will be shared for generating both embeddings and completions. This means for larger models: faster performance!

Jacob Lee

@Hacubu

1 year

🦙 Ollama Local Embeddings 🦙 Search with local Llama-2 embeddings in @LangChainAI JS/TS 🦜🔗 0.0.146. GPU-boosted in minutes with @jmorgan 's ! Incredible what you can do on a Macbook these days. Thank you GH user Basti-an!

2

6

44

2

9

35

Jeffrey Morgan

@jmorgan

7 months

Gemma is a new family of open models by Google offering best-in-class performance by size: 2B and 7B parameter models are available.

gemma

Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

ollama.com

3

32

Jeffrey Morgan

@jmorgan

4 months

@ollama @amgauge

GitHub - AugustDev/enchanted: Enchanted is iOS and macOS app for chatting with private self hosted...

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. - AugustDev/enchanted

github.com

3

0

31

Jeffrey Morgan

@jmorgan

1 year

Langchain + Ollama! Build apps that run entirely locally with Llama 2 Great for a ton of use cases like answering questions from local, private documents (see the example @Hacubu shared below) or building apps without incurring costs from cloud hosted LLMs.

Jacob Lee

@Hacubu

1 year

🚨🏠Set up all-local, JS-based retrieval + QA over docs in 5 minutes in @LangChainAI JS/TS 0.0.124! @TensorFlow JS embeddings, HNSWLib vector store, and the last missing piece: GPU-optimized 🦙Llama 2 w/ @jmorgan 's Use case + 🧵:

2

16

69

0

4

31

Jeffrey Morgan

@jmorgan

1 year

This morning Mistral AI released a new, best-in-class 7B model with with both chat and text completion variations. It's now on Ollama!

mistral

The 7B model released by Mistral AI, updated to version 0.3.

ollama.com

1

3

30

Jeffrey Morgan

@jmorgan

8 months

Stable Code 3B is a new 3B model with code completion results on part with Code Llama 7B. It supports Fill in Middle Capability (FIM) making it a great model to use for code completion tools.

ollama

@ollama

8 months

ollama run stable-code Try Stability AI's Stable Code 3B model. Learn more: Thank you @StabilityAI @ncooper57 @EMostaque and team for creating the model!

3

35

172

2

1

30

Jeffrey Morgan

@jmorgan

10 months

Dolphin 2.2 Mistral: a new uncensored model by @erhartford that is enhanced with additional data from the Airoboros and Samantha projects. Without assuming an identity of its own, this model is more empathetic and better handles long conversations.

dolphin-mistral

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

ollama.com

1

10

28

Jeffrey Morgan

@jmorgan

11 months

An alternative to GitHub Copilot focused on privacy that works with Llama 2, Code Llama, and Mistral.

Daniel San

@dani_avila7

11 months

This isn't Copilot 😱! This is @codegptAI with your own llama, codellama or mistral model running locally with absolute privacy for your code Available from version 2.1.28 onwards 🤩 Powered by @LangChainAI and @Ollama_ai Download the extension here:

14

91

460

0

5

29

Jeffrey Morgan

@jmorgan

6 months

Can't wait to see everyone in Paris at the next meetup. It's going to be the best one yet!

ollama

@ollama

6 months

Bonjour Ollama! Friends and Ollama are heading to Paris. If you are in the area, please join us for a developer meetup on Thursday, March 21st 6pm at Station F!

9

30

175

1

2

28

Jeffrey Morgan

@jmorgan

11 months

A new cookbook for running SQL queries with open-source LLMs, all running locally!

LangChain

@LangChainAI

11 months

⭐️ Private chat w/ SQL using LLaMA2 ⭐️ LLMs can serve as a natural language interface to structured data in SQL DBs. But, many text-to-SQL prompts rely on database schema / tables, e.g: Open source LLMs, like LLaMA2, are a great way to unlock LLM+SQL

4

77

336

2

6

24

Jeffrey Morgan

@jmorgan

9 months

DeepSeek LLM is a new language model by @deepseek_ai available in 7B and 67B parameter counts. This model has strong results in coding & math, and is trained on over 2 trillion bilingual tokens making it effective in both English and Chinese.

deepseek-llm

An advanced language model crafted with 2 trillion bilingual tokens.

ollama.com

2

6

22

Jeffrey Morgan

@jmorgan

5 months

⚡️ + 🦙

Paul Copplestone — e/postgres

@kiwicopple

5 months

Today we're adding native AI support in @supabase Edge Functions ◆ Embedding models ◆ Large language models (powered by @ollama ) We've removed the cold-boot by placing the models inside the edge runtime and we're rolling out a GPU-powered sidecar. See it in action:

14

35

267

0

3

21

Jeffrey Morgan

@jmorgan

4 months

@erhartford @CrusoeEnergy @LucasAtkins7 @FernandoNetoAi Awesome!

dolphin-mixtral:8x22b

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

ollama.com

0

4

20

Jeffrey Morgan

@jmorgan

5 months

@Kugs1776 @erhartford @CrusoeCloud @LucasAtkins7 @FernandoNetoAi @ollama

dolphin-llama3

Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.

ollama.com

0

6

19

Jeffrey Morgan

@jmorgan

5 months

Scale up LLMs with @ollama + @skypilot_org :

SkyPilot

@skypilot_org

5 months

💫 Scale quantized LLMs on your cloud/k8s with Ollama and SkyPilot! 📖 Use @ollama to run Mistral, Llama2-7B with just 4 CPUs and scale it up with SkyServe. Add GPUs w/ 1 line to make it faster! 💻 No cloud access, no problem - runs on your laptop too!

1

6

46

2

1

18

Jeffrey Morgan

@jmorgan

8 months

@rastadidi @erhartford @Magicoder_AI @zraytam Is updated to 2.6! I'll be combining the other two into this link as well, so all versions are in one place here: 😃

Tags · dolphin-mistral

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

ollama.com

1

3

16

Jeffrey Morgan

@jmorgan

11 months

Run open-source LLMs on @flydotio with @Ollama_ai 1. Download Ollama: 2. Run a model remotely on Fly: OLLAMA_HOST= ollama run mistral This is a demo instance, but sign up for the Fly GPU waitlist for your own!

Download Ollama on macOS

ollama.com

Fly.io

@flydotio

11 months

Ollama + GPUs!! Thanks @Ollama_ai

2

6

50

1

4

17

Jeffrey Morgan

@jmorgan

1 year

Wrote a short article about prompting Code Llama, Facebook Research's most recent model:

How to prompt Code Llama · Ollama Blog

This guide walks through the different ways to structure prompts for Code Llama and its different variations and features including instructions, code completion and fill-in-the-middle (FIM).

ollama.com

0

4

17

Jeffrey Morgan

@jmorgan

1 year

Uncensored models open a new world of possibilities. @erhartford trained the one in the article, and has an amazing guide that covers them:

Uncensored Models

I am publishing this because many people are asking me how I did it, so I will explain. https://huggingface.co/ehartford/WizardLM-30B-Uncensored https://huggingface.co/ehartford/WizardLM-13B-Uncens...

erichartford.com

2

4

16

Jeffrey Morgan

@jmorgan

11 months

sqlcoder is now available in a 7B parameter count version, which will run on smaller devices (e.g. Macbooks with 8GB of memory) ollama run sqlcoder:7b

Jeffrey Morgan

@jmorgan

11 months

sqlcoder is an open-source LLM that converts natural language to high-quality SQL queries, making it easy to query data for even complex schemas and questions Run it locally with Ollama:

14

160

1K

0

1

16

Jeffrey Morgan

@jmorgan

4 months

@erhartford @ollama on it

1

0

14

Jeffrey Morgan

@jmorgan

3 months

@rohanpaul_ai Looking into this case and will fix it. The OP didn’t share hardware or model examples but let me know if you see a speed discrepancy.

2

0

14

Jeffrey Morgan

@jmorgan

11 months

OpenHermes 2 Mistral 7B by @Teknium1 A new fine-tuned model based on Mistral, trained on open datasets totalling over 900,000 instructions. This model has strong multi-turn chat skills, surpassing previous Hermes 13B models and even matching 70B models on some benchmarks.

1

2

13

Jeffrey Morgan

@jmorgan

1 year

Use Ollama as a chat model with LangChain! Amazing to see this come together @RLanceMartin @Hacubu

Jacob Lee

@Hacubu

1 year

🦙 Ollama Local Chat Models 🦙 New in @LangChainAI : call local OSS models as chat models with @jmorgan ’s ! Llama 2 13B gets 50 tok/s on an M2 Mac with < 5 minutes of setup. S/o @RLanceMartin ! 🐍: ☕:

0

5

41

0

2

13

Jeffrey Morgan

@jmorgan

9 months

@HardKothari @rohanpaul_ai @OLLAMA 👀

Release v0.1.17 · ollama/ollama

Phi-2 This release adds support for the Phi-2 model by Microsoft. ollama run phi Phi-2 is a new, powerful 2.7B model with strong reasoning and language understanding capabilities comparable to lar...

github.com

1

2

13

Jeffrey Morgan

@jmorgan

10 months

Build a local, open-source version of ChatGPT with Mistral, Llama 2 and other open-source models.

MatthewBerman

@MatthewBerman

10 months

Want to build your own local, open-source ChatGPT? It's easy with @Ollama_ai ! Plus, you can run MANY models in parallel without spending a ton of $$ on GPUs. Here's how:

4

41

1

2

12

Jeffrey Morgan

@jmorgan

10 months

0

11

Jeffrey Morgan

@jmorgan

9 months

LLM-powered Tamagotchi that all runs locally 🐣

Yoko

@stuffyokodraws

9 months

[NEW LAUNCH] 0/ ✨AI-tamago🐣: A local-ready LLM-generated and LLM-driven tamagotchi with thoughts and feelings. 100% Javascript and costs $0 to run. 🧵 Stack: - 🎮 Game state & reliable AI calls: @inngest - 🦙 Inference: @Ollama_ai (local), @OpenAI ,

21

80

452

2

0

11

Jeffrey Morgan

@jmorgan

10 months

@tringuyenkv Soon!

Patrick Devine

@pdev110

10 months

@steveeichert @Ollama_ai We've been working on getting multi-modal support working. Here's a quick demo:

5

0

25

1

0

11

Jeffrey Morgan

@jmorgan

5 months

@__thetaphipsi @steipete Should be fixed now! Need to re-pull the model: ollama pull llama3:70b (or the model you used – e.g. ollama pull llama3) Note: the download should be instantaneous since it's a small change to the runtime parameters. Sorry again this happened!!

0

10

Jeffrey Morgan

@jmorgan

4 months

@youraimarketer @erhartford @CrusoeEnergy @LucasAtkins7 @FernandoNetoAi @ollama 🫡!!

dolphin-mixtral:8x22b

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

ollama.com

1

10

Jeffrey Morgan

@jmorgan

1 year

As of version 0.0.19 Ollama supports running Falcon 180B with a single command: ollama run falcon:180b The smaller siblings work too: falcon:7b and falcon:40b

1

10

Jeffrey Morgan

@jmorgan

11 months

7B models that punch well above their weight class are becoming increasingly popular – they run quite fast locally!

1

0

10

Jeffrey Morgan

@jmorgan

11 months

@rishdotblog @dharmesh @defogdata @mchiang0610 The 7b version of sqlcoder is available on Ollama: ollama run sqlcoder:7b GPU accelerated on Linux & Mac (Windows coming soon :-) There's an API as well for building apps (incl Mac apps!)

sqlcoder:7b

SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks

ollama.com

0

3

10

Jeffrey Morgan

@jmorgan

7 months

@Technovangelist @Ex3NDR @continuedev @ollama Also check out Cody + Ollama!

Cody for VS Code v1.1.0 release

Introducing Cody v1.1.0: Enhanced Edit Code Command with Contextual Awareness, Offline Autocomplete with Code Llama Integration, and Improved Chat @-Mentions, aimed at boosting coding productivity in...

sourcegraph.com

1

10

Jeffrey Morgan

@jmorgan

5 months

@colhountech @ollama Sorry about this. Working on a fix. It seems to be from hitting the context length of the model. It can be increased by running `/set parameter num_ctx 8192` in with ollama run command. If you're using the API, it's the `num_ctx` option.

1

0

9

Jeffrey Morgan

@jmorgan

11 months

Chat with local LLMs using a friendly user interface on macOS @itsKGhandour 's Ollama SwiftUI is a native interface on macOS for downloading and chatting with LLMs, written in Swift and open-source

Karim ElGhandour

@itsKGhandour

11 months

After getting exposed to @Ollama_ai , I decided to learn Swift over the past week and create a native user-friendly interface to be able to quickly chat with LLMs. It is now public, open source and waiting for your feedback! Check it out now!

0

2

6

0

1

9

Jeffrey Morgan

@jmorgan

1 year

Ollama + Tailscale = A personal LLM that you can connect to from anywhere. Love the design @andybons 🤩

1

4

8

Jeffrey Morgan

@jmorgan

10 months

KubeCon x Ollama 🚀

Saiyam Pathak

@SaiyamPathak

10 months

Dynamic resource allocation gets a mention, this is ollama and Kind cluster!

1

0

1

8

Jeffrey Morgan

@jmorgan

1 year

🦙+ 🐧

Michael

@mchiang0610

1 year

I'm so excited about this one... LET'S GO!!!

0

6

1

8

Jeffrey Morgan

@jmorgan

1 year

Connect dozens of data connectors – Obsidian, Databases, APIs and more – to Ollama with the new LlamaIndex integration ⭐️.

LlamaIndex 🦙

@llama_index

1 year

We’re now integrated with ( @jmorgan ) 🦙🎉 Our favorite part is the simplicity; do `ollama pull` and `ollama run` to run Llama 2, Code Llama, or other open LLMs. Easily plug it into @llama_index RAG pipeline. Thanks @husjerry1 ! 🙌

0

8

42

0

7

Jeffrey Morgan

@jmorgan

5 months

@karpathy Thoughts on MLX?

GitHub - ml-explore/mlx: MLX: An array framework for Apple silicon

MLX: An array framework for Apple silicon. Contribute to ml-explore/mlx development by creating an account on GitHub.

github.com

1

0

7

Jeffrey Morgan

@jmorgan

11 months

Great seeing you 😊

Quinn Slack

@sqs

11 months

Met up with @jmorgan at SFO to chat about @Ollama_ai , building Cody on it, speeding up local inference, etc.

2

0

41

0

7

Jeffrey Morgan

@jmorgan

11 months

This model will need 16GB of memory to run on macOS using Ollama:

GitHub - ollama/ollama: Get up and running with Llama 3.1, Mistral, Gemma 2, and other large...

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models. - ollama/ollama

github.com

1

0

7

Jeffrey Morgan

@jmorgan

1 year

LiteLLM: a lightweight python package for interacting with a variety of LLMs like OpenAI and Anthropic – taking care of input & output translation. Now you can use LiteLLM with Ollama models too:

Ishaan

@ishaan_jaff

1 year

💥 @LiteLLM x Ollama Integration live now 🦙Call your local llama2 models using chatGPT Input/Output Use our proxy to get Caching, logging, error handling, 50+ LLM APIs (OpenAI, Azure, Anthropic) Try it here: We love the work being done @jmorgan on this

2

6

1

2

7

Jeffrey Morgan

@jmorgan

11 months

Great talk and thread on accessing LLMs from the browser by @Hacubu

Jacob Lee

@Hacubu

11 months

Local LLMs are incredible, but their current reach is just engineers. How do we change that? I gave a @GoogleAI WebML Summit talk on how e.g. @LangChainAI & @ollama_ai enable client-side AI in web apps. The punchline: we need a new browser API! (1/9)

5

17

97

0

1

7

Jeffrey Morgan

@jmorgan

8 months

@visheratin @erhartford Yes! It works today! Check out Congrats on getting this model to such an impressive size. How similar is this to the previous Llava 7b/13b models?

llava

🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

ollama.com

1

0

7

Jeffrey Morgan

@jmorgan

5 months

@martoast @erhartford @CrusoeCloud @LucasAtkins7 @FernandoNetoAi ollama run dolphin-llama3

1

0

6

Jeffrey Morgan

@jmorgan

1 year

Ollama in 🇨🇱

Daniel San

@dani_avila7

1 year

Playing with in @Microsoft Chile 🇨🇱 #ollama #llama2 #microsoft #Azure #AI

0

1

11

0

1

6

Jeffrey Morgan

@jmorgan

3 months

@erhartford @bartowski1182 Sorry you hit this, it will be fixed in the next release of Ollama. The pretokenizer change wasn’t backward compatible and new converts may not run on older versions of llama.cpp

1

0

5

Jeffrey Morgan

@jmorgan

9 months

@imhaotian Model:

llava

🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

ollama.com

1

6