Member-only story

Running the Phi-2 LLM on Ollama !

2 min readDec 23, 2023

Large Language Models (LLMs) are fascinating, and their benefits are obvious and well discussed all over the internet. However, it’s also crucial to understand the mechanics behind these models. This knowledge is not just interesting — it’s essential for security purposes and effective application development.

I’m not pretending to understand the inner workings of LLM’s, but I’m convinced playing around and learn about the tooling available is a good step forward on our shared journey.

An easy way to get started is always some easy working examples.

As I am a big fanatic of containers and kubernetes, let’s look for a container example.

An interesting project is https://ollama.ai/. It offers an docker image we can leverage and good instructions.

So let’s get started. When things start running, we basicly have a container running that can be interacted with via an API or the CLI. We’l use the CLI for now.

docker run -d -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama

Take note that running these systems locally might require some disk space and cpu.

Once we have our container up and running, we need to download and run the LLM of our choice.

docker exec -it ollama…

Running the Phi-2 LLM on Ollama !

Create an account to read the full story.

Written by Philippe Bogaerts

No responses yet

More from Philippe Bogaerts

Stratoshark remote capture tutorial

Remote capture Stratoshark and sysdig traces

Analyzing HTTP/HTTPS Traffic with Stratoshark

Understanding System Calls: The Key to Cloud-Native Security and Observability

Ollama Tool Support and Call Interception Using MITM Proxy

Enhance Your AI Models with Ollama Tool Support and API Call Interception Using MITM Proxy — A Step-by-Step Guide”

Mitmproxy and Kubernetes

Solving the untrusted certificate issues in pods !!

Recommended from Medium

Exploring Ollama REST API Endpoints

Ollama is a powerful tool for running and interacting with AI models locally. It provides REST API endpoints that allow you to generate…

Building a Local LLM Server: How to Run Multiple Models Efficiently

I’m writing a series on how to utilize a home Ubuntu server as a local LLM server. While working on LLM projects, I’ve pondered how…

Lists

Generative AI Recommended Reading

What is ChatGPT?

The New Chatbots: ChatGPT, Bard, and Beyond

Natural Language Processing

Running LLaMA Locally with Llama.cpp: A Complete Guide

Llama.cpp is a powerful and efficient inference framework for running LLaMA models locally on your machine. Unlike other tools such as…

You’re Doing RAG Wrong: How to Fix Retrieval-Augmented Generation for Local LLMs

How To Set Up RAG Locally, Avoid Common Issues, and Improve RAG Retrieval Accuracy.

Run Llama 3 on your laptop: An introduction to Ollama for beginners.

Running open-source LLMs has never been easier.

Comparative Analysis of AI Agent Frameworks with DSPy: LangGraph, AutoGen and CrewAI

Introduction