Tuesday, April 28, 2026

10 Python Libraries for Constructing LLM Functions



Picture by Writer

 

Introduction

 
Constructing giant language mannequin (LLM) functions could be very completely different from utilizing consumer-facing instruments like Claude Code, ChatGPT, or Codex. These merchandise are nice for finish customers, however whenever you wish to construct your personal LLM system, you want much more management over how every part works behind the scenes.

That normally means working with libraries and frameworks that enable you to load open-source fashions, construct retrieval-augmented technology (RAG) pipelines, serve fashions by APIs, fine-tune them by yourself knowledge, create agent-based workflows, and consider how effectively every part performs. The problem is that LLM software growth is not only about prompting a mannequin. There are quite a lot of shifting components, and placing them collectively into one thing dependable can get difficult quick.

On this article, we are going to take a look at 10 Python libraries that make that course of simpler. Whether or not you’re experimenting with native fashions, constructing production-ready pipelines, or testing multi-agent techniques, these libraries can assist you progress sooner and construct with extra confidence.

 

1. Transformers

 
Transformers is the library that sits on the middle of quite a lot of open-source LLM work. If you wish to load a mannequin, tokenize textual content correctly, run it for technology, or fine-tune it by yourself knowledge, that is normally the place you begin.

Fashions like GLM, Minimax, and Qwen are generally used by Transformers, and quite a lot of different instruments within the LLM stack are designed to work effectively with it.

What makes it particularly helpful is that it saves you from having to deal with all of the low-level mannequin setup your self. As an alternative of constructing every part from scratch, you need to use a constant interface throughout many various fashions and duties, which makes experimenting, testing, and shifting into manufacturing a lot simpler.

 

2. LangChain

 
LangChain is helpful if you find yourself now not simply sending one immediate to at least one mannequin and calling it a day. It helps you join the items that actual LLM apps normally want — like prompts, retrievers, instruments, APIs, and mannequin calls — into one circulation, which is why it’s generally used for issues like chatbots, RAG techniques, and agent-style functions.

What makes it sensible is that it provides construction to a messy stack. As an alternative of wiring each step your self, you need to use it to handle multi-step logic, join outdoors techniques, and construct functions that do greater than generate textual content, which is an enormous cause it turned one of many best-known frameworks on this house.

 

3. LlamaIndex

 
If LangChain helps you join the shifting components of an LLM app, LlamaIndex helps you join that app to the info it really wants. It’s particularly helpful for RAG, the place the mannequin wants to tug in info from paperwork, PDFs, databases, or different information sources earlier than answering.

That issues as a result of most helpful LLM functions can not depend on mannequin reminiscence alone. By grounding responses in actual knowledge, LlamaIndex helps make solutions extra related, extra updated, and way more sensible for issues like inner assistants, information bases, and document-heavy workflows.

 

4. vLLM

 
vLLM is likely one of the hottest libraries for serving open-source LLMs effectively. It’s constructed for quick inference, higher GPU reminiscence use, and high-throughput technology, which makes it a robust selection whenever you wish to run fashions in a method that feels sensible quite than experimental.

What makes it vital is that serving a mannequin effectively is an enormous a part of constructing an actual LLM software. vLLM helps make open fashions simpler to deploy at scale, deal with extra requests, and generate responses sooner, which is why so many groups use it when shifting from testing to manufacturing.

 

5. Unsloth

 
Unsloth has change into a preferred selection for fine-tuning as a result of it makes the method way more accessible for smaller groups and particular person builders. It’s particularly identified for environment friendly low-rank adaptation (LoRA) and quantized LoRA (QLoRA) workflows, the place the aim is to coach or adapt a mannequin sooner whereas utilizing much less VRAM than heavier fine-tuning setups.

What makes it vital is that it lowers the price of really customizing highly effective fashions. As an alternative of needing large {hardware} simply to get began, builders can fine-tune fashions in a extra sensible method on restricted assets, which is an enormous cause Unsloth has change into such a typical decide for resource-efficient coaching.

 

6. CrewAI

 
CrewAI is a well-liked framework for constructing multi-agent functions the place completely different brokers tackle completely different roles, targets, and duties. As an alternative of counting on one mannequin name to do every part, it provides you a approach to arrange a small staff of brokers that may collaborate, use instruments, and work by structured workflows collectively.

What makes it helpful is that extra LLM apps are beginning to look much less like easy chatbots and extra like coordinated techniques. CrewAI helps builders construct these agent-based workflows in a cleaner method, particularly when a process advantages from planning, delegation, or splitting work throughout specialist brokers.

 

7. AutoGPT

 
AutoGPT continues to be one of many best-known names within the agent world as a result of it helped introduce lots of people to the concept of AI techniques that may plan duties, break targets into steps, and take actions with much less back-and-forth from the person. It turned well known as an early instance of what autonomous agent workflows might appear to be, which is why it nonetheless comes up so typically in conversations about agent growth.

A key function it offers is assist for goal-driven, multi-step process execution. In follow, which means you need to use it to construct brokers that plan, handle steps throughout a workflow, and automate longer-running duties in a extra structured method than a easy chat interface.

 

8. LangGraph

 
LangGraph is constructed for builders who want extra management over how an LLM software runs. As an alternative of utilizing a easy linear chain, it allows you to design stateful workflows with branching paths, reminiscence, and multi-step logic, which makes it a robust match for extra superior agent techniques and long-running duties.

What makes it helpful is the additional construction it provides you. You possibly can outline how execution ought to transfer from one step to a different, hold observe of state throughout the workflow, and construct techniques which might be simpler to handle when the logic will get extra advanced than a primary immediate pipeline.

 

9. DeepEval

 
DeepEval is a Python framework constructed for testing and evaluating LLM functions. As an alternative of simply checking whether or not a mannequin provides a solution, it helps you measure issues like reply relevance, hallucination, faithfulness, and process success, which makes it helpful as soon as your app begins turning into one thing folks really depend on.

What makes it vital is that constructing an LLM app is not only about technology — it’s also about figuring out whether or not the system is working effectively. DeepEval provides builders a extra structured approach to take a look at prompts, RAG pipelines, and agent workflows, which is an enormous a part of making an software extra dependable earlier than and after it reaches manufacturing.

 

10. OpenAI Python SDK

 
The OpenAI Python SDK is likely one of the best methods so as to add LLM options to an software with out having to handle your personal mannequin internet hosting. It provides Python builders a easy interface for working with hosted OpenAI fashions, so you’ll be able to construct issues like chat options, reasoning workflows, image-aware apps, and different multimodal experiences a lot sooner.

What makes it so helpful is velocity and ease. As an alternative of worrying about serving fashions, scaling inference, or dealing with the low-level infrastructure your self, you’ll be able to concentrate on constructing the precise product logic, which is an enormous cause the SDK stays such a typical selection for API-based LLM functions.

 

Evaluating the ten Libraries

 
Here’s a fast side-by-side view of what every library is principally used for.

 

Library Finest For Why It Issues
Transformers Mannequin loading and fine-tuning Types the muse of a lot of the open LLM ecosystem
LangChain LLM app workflows Connects prompts, instruments, retrieval, and APIs into one circulation
LlamaIndex RAG and knowledge-based apps Helps floor responses in actual knowledge
vLLM Quick inference and serving Makes open fashions simpler to deploy effectively
Unsloth Environment friendly fine-tuning Lowers the price of adapting highly effective fashions
CrewAI Multi-agent techniques Helps construction agent roles and workflows
AutoGPT Autonomous agent experiments Helps goal-driven, multi-step process execution
LangGraph Stateful agent orchestration Provides extra management for advanced workflows
DeepEval Analysis and testing Helps measure reliability earlier than manufacturing
OpenAI Python SDK API-based LLM apps One of many quickest methods to ship LLM options

 
 

Abid Ali Awan (@1abidaliawan) is an authorized knowledge scientist skilled who loves constructing machine studying fashions. At the moment, he’s specializing in content material creation and writing technical blogs on machine studying and knowledge science applied sciences. Abid holds a Grasp’s diploma in know-how administration and a bachelor’s diploma in telecommunication engineering. His imaginative and prescient is to construct an AI product utilizing a graph neural community for college students battling psychological sickness.

Related Articles

Latest Articles