DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How to get up & running a LLM locally - in 5 minutes

How to get up & running a LLM locally - in 5 minutes

Comments
2 min read
Streamlining the Hiring Process with OpenAI and LangChain Part 1

Streamlining the Hiring Process with OpenAI and LangChain Part 1

Comments
7 min read
How I Built Graph based AI-Powered Search Engine, All Local

How I Built Graph based AI-Powered Search Engine, All Local

Comments
2 min read
FLaNK AI Weekly 18 March 2024

FLaNK AI Weekly 18 March 2024

5
Comments
5 min read
Using large language models in software architecture

Using large language models in software architecture

Comments
2 min read
pgvector vs. pgvecto.rs in 2024: A Comprehensive Comparison for Vector Search in PostgreSQL

pgvector vs. pgvecto.rs in 2024: A Comprehensive Comparison for Vector Search in PostgreSQL

10
Comments
7 min read
Getting Started with Lamma.cpp on Arch Linux!

Getting Started with Lamma.cpp on Arch Linux!

1
Comments
2 min read
How to Run an LLM Locally with Pieces

How to Run an LLM Locally with Pieces

1
Comments
7 min read
The Future of Natural Language APIs

The Future of Natural Language APIs

Comments
2 min read
Navigating the Future with AI Copilots: A Comprehensive Guide

Navigating the Future with AI Copilots: A Comprehensive Guide

1
Comments
12 min read
AI agents will make the cloud cool again for developers

AI agents will make the cloud cool again for developers

Comments
2 min read
Open Source Day 2024

Open Source Day 2024

Comments
5 min read
Screen resumes in minutes

Screen resumes in minutes

Comments
1 min read
Supercharging LLM Training with Groq and LPUs

Supercharging LLM Training with Groq and LPUs

Comments
21 min read
Install AUTO-GPT on mac OS (march 2024)

Install AUTO-GPT on mac OS (march 2024)

Comments
2 min read
When was the last time you learn something from your LLM logs ? Here is the solution : phospho

When was the last time you learn something from your LLM logs ? Here is the solution : phospho

Comments 3
2 min read
Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

1
Comments
14 min read
Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Comments
12 min read
Deploy Mistral Large to Azure and create a conversation with Python and LangChain

Deploy Mistral Large to Azure and create a conversation with Python and LangChain

3
Comments
5 min read
GenLearn - Your Personalized Learning Assistant!

GenLearn - Your Personalized Learning Assistant!

7
Comments
5 min read
k8s-snap (Canonical Kubernetes) pour un déploiement simple et rapide d’un cluster k8s …

k8s-snap (Canonical Kubernetes) pour un déploiement simple et rapide d’un cluster k8s …

Comments
4 min read
Top-Trending LLMs Over the Last Week

Top-Trending LLMs Over the Last Week

1
Comments
4 min read
LangChain Memory: Enhancing AI Conversational Capabilities

LangChain Memory: Enhancing AI Conversational Capabilities

17
Comments
5 min read
Falcon 180B: Advancing Language Models in the AI Frontier

Falcon 180B: Advancing Language Models in the AI Frontier

17
Comments
5 min read
Creating an AI BLOGGER with Lyzr, LlamaIndex, Perplexity, GPT4

Creating an AI BLOGGER with Lyzr, LlamaIndex, Perplexity, GPT4

Comments 1
1 min read
Gemini Function Calling

Gemini Function Calling

Comments
1 min read
Matryoshka Embeddings: The new kind of efficient embeddings

Matryoshka Embeddings: The new kind of efficient embeddings

Comments
13 min read
FLaNK AI for 11 March 2024

FLaNK AI for 11 March 2024

6
Comments
6 min read
LLMs on your local Computer (Part 1)

LLMs on your local Computer (Part 1)

Comments
20 min read
Workflow Integration with AI: A Unified Approach to Development

Workflow Integration with AI: A Unified Approach to Development

10
Comments
10 min read
Build a Copilot with PHI-2 Using Pieces Client

Build a Copilot with PHI-2 Using Pieces Client

3
Comments
7 min read
What is RAG? A quick 101

What is RAG? A quick 101

9
Comments
3 min read
Running Local LLMs, CPU vs. GPU - a Quick Speed Test

Running Local LLMs, CPU vs. GPU - a Quick Speed Test

31
Comments 15
3 min read
All About Google Gemma

All About Google Gemma

Comments
2 min read
Limitations of Running AI Agents Locally

Limitations of Running AI Agents Locally

3
Comments
3 min read
Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

6
Comments 3
4 min read
Generate your docstrings automatically with zero-docs

Generate your docstrings automatically with zero-docs

1
Comments
2 min read
The Rise of the 1-Bit LLM

The Rise of the 1-Bit LLM

11
Comments 5
19 min read
Reduce your LLM costs by 10x using semantic caching

Reduce your LLM costs by 10x using semantic caching

2
Comments
2 min read
(Easier) Root Cause Analysis of the Failure

(Easier) Root Cause Analysis of the Failure

2
Comments
2 min read
What are LLMs, Local LLMs and RAG?

What are LLMs, Local LLMs and RAG?

1
Comments
7 min read
LLMs for Text-to-SQL problems: the benchmark vs real-world performance

LLMs for Text-to-SQL problems: the benchmark vs real-world performance

2
Comments
8 min read
AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

1
Comments
1 min read
Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

17
Comments 1
3 min read
Real-time text to speech conversation about friends and Ray Bradbury with my computer

Real-time text to speech conversation about friends and Ray Bradbury with my computer

1
Comments
2 min read
Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

6
Comments
8 min read
How does the Groq's LPU work?

How does the Groq's LPU work?

Comments
7 min read
FLaNK 04 March 2024

FLaNK 04 March 2024

6
Comments
6 min read
How to setup your own ChatGPT with OpenLLaMA

How to setup your own ChatGPT with OpenLLaMA

Comments
1 min read
Generative AI in QuickSight

Generative AI in QuickSight

Comments
2 min read
Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

5
Comments 1
6 min read
FLaNK Stack 29 Jan 2024

FLaNK Stack 29 Jan 2024

5
Comments
6 min read
Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Comments
2 min read
Prompt Engineering for OpenAI Chat Completions

Prompt Engineering for OpenAI Chat Completions

Comments
7 min read
Boost Your Productivity with Walles.AI: A Comprehensive Guide to Efficient Task Management

Boost Your Productivity with Walles.AI: A Comprehensive Guide to Efficient Task Management

3
Comments
3 min read
Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

6
Comments
7 min read
Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

1
Comments 1
9 min read
Async AI Workflows with Graph Theory

Async AI Workflows with Graph Theory

6
Comments
2 min read
Are LLM's essentially Teenagers?

Are LLM's essentially Teenagers?

12
Comments 1
6 min read
Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

56
Comments 7
4 min read
loading...