20+ Common Libraries/ Packages needed to build LLM apps

Share it with your senior IT friends and colleagues
Reading Time: 2 minutes

Common Libraries/ Packages are needed to build AI/ LLM applications that we use but not many are telling their exact use.

Pypdf – A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files.

Tiktoken
A fast BPE (Byte Pair Encidong) tokeniser to use with OpenAI’s models

Transformers
Transformers provide thousands of pre-trained models to perform tasks on different modalities such as text, vision, and audio

Bitsandbytes –
To enable large language models to be accessible via k-bit quantization for PyTorch.

Accelerate –
It is a library that enables the same PyTorch code to be run across any distributed configuration by adding just four lines of code! In short, training and inference at scale are simple, efficient, and adaptable.

Duckduck Go Search –
Search for words, documents, images, etc. using the DuckDuckGo.com search engine.

Langchain –
contains higher-level and use-case-specific Langchain components that are at the core of the application’s architecture.

Langchain community –
It contains all the third-party integrations. These integrations are ready to use in any LangChain application.

Langchain_core –
contains simple, core abstractions that have emerged as a standard, as well as LangChain Expression Language as a way to compose these components together.

langchain_openai –
This package contains the LangChain integrations for OpenAI

Faiss-cpu –
For efficient similarity search and clustering of dense vectors for CPU version

Faiss-gpu –
For efficient similarity search and clustering of dense vectors for the GPU version

Streamlit
Streamlit lets you transform Python scripts into interactive web apps in minutes

Google-generativeai –
The Google AI Python SDK is the easiest way for Python developers to build with the Gemini API.

Python-dotenv –
Read key-value pairs from a .env file and set them as environment variables

Pdf2image –
A wrapper around the command line tools to convert PDF to a PIL Image list.

Chromadb –
to access the open-source embedding database

Pathlib –
It offers a set of classes to handle filesystem paths.

Youtube_transcript_api –
To get the transcripts/subtitles for a given YouTube video. It also works for automatically generated subtitles and supports translating subtitles.

gTTS –
(Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate’s text-to-speech API.

Which one you have used most?

Tailored AI + LLM Coaching for Senior IT Professionals

In case you are looking to learn AI + Gen AI in an instructor-led live class environment, check out these dedicated courses for senior IT professionals here

Pricing for AI courses for senior IT professionals – https://www.aimletc.com/ai-ml-etc-course-offerings-pricing/

My Name is Nikhilesh and if you have any feedback/suggestions on this article, please feel free to connect with me – https://www.linkedin.com/in/nikhileshtayal/

Share it with your senior IT friends and colleagues
Nikhilesh Tayal
Nikhilesh Tayal
Articles: 67