Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances
Today, we are thrilled to announce the availability of G7e instances powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs on Amazon…
Run Llama 2 uncensored locally August 1, 2023 This post will give some example comparisons running Llama 2 uncensored model versus its censored model.
This post will give some example comparisons running Llama 2 uncensored model versus its censored model.
Run Code Llama locally August 24, 2023 Meta's Code Llama is now available on Ollama to try.
Meta's Code Llama is now available on Ollama to try.
How to prompt Code Llama September 9, 2023 This guide walks through the different ways to structure prompts for Code Llama and its different variations and features including instructions, code completion and fill-in-the-middle (FIM).
This guide walks through the different ways to structure prompts for Code Llama and its different variations and features including instruc…
Leveraging LLMs in your Obsidian Notes September 21, 2023 This post walks through how you could incorporate a local LLM using Ollama in Obsidian, or potentially any note taking tool.
This post walks through how you could incorporate a local LLM using Ollama in Obsidian, or potentially any note taking tool.
Ollama is now available as an official Docker image October 5, 2023 Ollama can now run with Docker Desktop on the Mac, and run inside Docker containers with GPU acceleration on Linux.
Ollama can now run with Docker Desktop on the Mac, and run inside Docker containers with GPU acceleration on Linux.
Building LLM-Powered Web Apps with Client-Side Technology October 13, 2023 Recreate one of the most popular LangChain use-cases with open source, locally running software - a chain that performs Retrieval-Augmented Generation, or RAG for short, and allows you to “chat with your documents”
Recreate one of the most popular LangChain use-cases with open source, locally running software - a chain that performs Retrieval-Augmented…
Python & JavaScript Libraries January 23, 2024 The initial versions of the Ollama Python and JavaScript libraries are now available, making it easy to integrate your Python or JavaScript, or Typescript app with Ollama in a few lines of code. Both libraries include all the features of the Ollama REST API, are familiar in design, and compatible with new and previous versions of Ollama.
The initial versions of the Ollama Python and JavaScript libraries are now available, making it easy to integrate your Python or JavaScript…
Vision models February 2, 2024 New vision models are now available: LLaVA 1.6, in 7B, 13B and 34B parameter sizes. These models support higher resolution images, improved text recognition and logical reasoning.
New vision models are now available: LLaVA 1.6, in 7B, 13B and 34B parameter sizes. These models support higher resolution images, improved…
OpenAI compatibility February 8, 2024 Ollama now has initial compatibility with the OpenAI Chat Completions API, making it possible to use existing tooling built for OpenAI with local models via Ollama.
Ollama now has initial compatibility with the OpenAI Chat Completions API, making it possible to use existing tooling built for OpenAI with…
Windows preview February 15, 2024 Ollama is now available on Windows in preview, making it possible to pull, run and create large language models in a new native Windows experience. Ollama on Windows includes built-in GPU acceleration, access to the full model library, and serves the Ollama API including OpenAI compatibility.
Ollama is now available on Windows in preview, making it possible to pull, run and create large language models in a new native Windows exp…
Ollama now supports AMD graphics cards March 14, 2024 Ollama now supports AMD graphics cards in preview on Windows and Linux. All the features of Ollama can now be accelerated by AMD graphics cards on Ollama for Linux and Windows.
Ollama now supports AMD graphics cards in preview on Windows and Linux. All the features of Ollama can now be accelerated by AMD graphics c…