Hugging Face Hub Client Library Cheatsheet

Login/Logout & Authentication 🔐

Log in using a user access token: huggingface_hub.login(token, add_to_git_credential=False, new_session=True, write_permission=False).
Log in using the terminal: huggingface_hub.interpreter_login(new_session=True, write_permission=False).
Log in using a notebook widget: huggingface_hub.notebook_login(new_session=True, write_permission=False).
Log out and delete the saved token: huggingface_hub.logout().

Environment Variables 🌳

Local data directory for Hugging Face Hub: HF_HOME.
Cache directory for models, datasets, and spaces: HF_HUB_CACHE.
Cache directory for downstream library assets: HF_ASSETS_CACHE.
Your Hugging Face user access token: HF_TOKEN.
Set to 1 to enable offline mode: HF_HUB_OFFLINE.
Set to 1 to disable progress bars: HF_HUB_DISABLE_PROGRESS_BARS.
Set to 1 to disable telemetry: HF_HUB_DISABLE_TELEMETRY.

`HfApi` Client 🌐

Create an HfApi client to interact with the Hub: huggingface_hub.HfApi(endpoint=None, token=None, ...).
Create a new repository: api.create_repo(repo_id, token=None, private=False, repo_type=None, exist_ok=False, ...).
Delete a repository: api.delete_repo(repo_id, token=None, repo_type=None, missing_ok=False).
Upload a file: api.upload_file(path_or_fileobj, path_in_repo, repo_id, token=None, repo_type=None, revision=None, ...).
Upload a folder: api.upload_folder(folder_path, path_in_repo=None, repo_id, token=None, repo_type=None, revision=None, ...).
Download a file: api.hf_hub_download(repo_id, filename, revision=None, cache_dir=None, force_download=False, ...).
List available models: api.list_models(filter=None, author=None, search=None, sort=None, direction=None, limit=None, ...).
List available datasets: api.list_datasets(filter=None, author=None, search=None, sort=None, direction=None, limit=None, ...).
Get model information: api.model_info(repo_id, revision=None, timeout=None, securityStatus=None, files_metadata=False, ...).
Get dataset information: api.dataset_info(repo_id, revision=None, timeout=None, files_metadata=False, ...).

Hugging Face Hub - 🧠 Inference

Inference

Inference is the process of using a trained model to make predictions on new data. 🔮
Running inference on a dedicated server can be more efficient. 🖥️
The huggingface_hub library simplifies calling inference services for hosted models. 🧰

Inference Services

Inference API: Free, accelerated inference on Hugging Face's infrastructure. ⚡️ Great for getting started, testing models, and prototyping. 🔨
Inference Endpoints: Easily deploy models to production with fully managed infrastructure on your cloud provider. ☁️

Inference Client

The InferenceClient object connects to inference services. 🔌

model: Model ID (e.g., "meta-llama/Meta-Llama-3-8B-Instruct") or Inference Endpoint URL. 🆔
token: Hugging Face authentication token. 🔑
timeout: Maximum wait time for server response. ⏱️

Async Inference Client

An asynchronous version using asyncio and aiohttp. 💫

Install: pip install --upgrade huggingface_hub[inference]
Similar methods to the synchronous client, but with await. ⏳

Key Methods - More Details

audio_classification(audio, model): Classifies audio content into predefined categories. 🎧
- audio: The audio content to classify (file path, bytes, URL).
- model: The audio classification model to use.
- Returns: A list of predicted labels with confidence scores.
automatic_speech_recognition(audio, model): Transcribes audio to text (ASR). 🎤
- audio: The audio content to transcribe (file path, bytes, URL).
- model: The ASR model to use.
- Returns: The transcribed text, potentially with timestamps.
chat_completion(messages, model, ...): Generates conversational responses in a chat-like format. 💬
- messages: A list of chat messages with roles (user, assistant, system).
- model: The conversational model to use.
- Additional parameters: max_tokens, temperature, etc.
- Returns: The generated chat response.
feature_extraction(text, model): Generates numerical representations (embeddings) of text. 🔢
- text: The text to embed.
- model: The text embedding model to use.
- Returns: A numerical vector representing the text.
image_classification(image, model): Classifies images into predefined categories. 🖼️
- image: The image to classify (file path, bytes, URL).
- model: The image classification model to use.
- Returns: A list of predicted labels with confidence scores.
text_generation(prompt, model, ...): Generates text based on a given prompt. ✍️
- prompt: The starting text for the generation.
- model: The text generation model to use.
- Additional parameters: max_new_tokens, temperature, etc.
- Returns: The generated text.
text_to_image(prompt, model, ...): Generates images from text descriptions. 🎨
- prompt: The text description of the image to generate.
- model: The text-to-image model to use.
- Additional parameters: height, width, etc.
- Returns: The generated image.
translation(text, model, src_lang, tgt_lang): Translates text from one language to another. 🌐
- text: The text to translate.
- model: The translation model to use.
- src_lang: The source language (optional).
- tgt_lang: The target language (optional).
- Returns: The translated text.

Complete Example: Text-to-Image 🖼️

from huggingface_hub import InferenceClient
Initialize the InferenceClient
client = InferenceClient(token="YOUR_HUGGING_FACE_TOKEN")

Generate an image
image = client.text_to_image(
    prompt="A cat wearing a top hat riding a unicycle on a tightrope",
    model="stabilityai/stable-diffusion-2-1",  # Specify the text-to-image model
    height=512,  # Optional: Set image height
    width=512,   # Optional: Set image width
)

Save the image
image.save("cat_unicycle.png")

Complete Example: Text Generation ✍️

from huggingface_hub import InferenceClient
Initialize the InferenceClient
client = InferenceClient(token="YOUR_HUGGING_FACE_TOKEN")

Generate text
generated_text = client.text_generation(
    prompt="Once upon a time, in a land far away, ",
    model="gpt2",  # Specify the text generation model
    max_new_tokens=50,  # Limit the length of generated text
    temperature=0.7,  # Control the randomness (higher = more random)
)

Print the generated text
print(generated_text)

🤗 Hugging Face Hub Client Library Cheatsheet

Quickstart 🏁

Installation 💻

Download Files 📥

Authentication 🔐

Create a Repo ➕

Upload Files 📤

Inference API 🧠

Manage Repos 📁

Discussions & Pull Requests 💬

Collections 📚

Cache Management 🧹

Model Cards 📝

Login/Logout & Authentication 🔐

Environment Variables 🌳

HfApi Client 🌐

Hugging Face Hub - 🧠 Inference

Inference

Inference Services

Inference Client

Async Inference Client

Key Methods - More Details

Complete Example: Text-to-Image 🖼️

Complete Example: Text Generation ✍️

Tokenizers 🤗

🌟 Main Features

🚀 Quick Tour

🔧 Installation

🧰 Pre-Tokenizers

⚙️ Models

🔄 Post-Processors

🛠️ Normalizers

🧪 Training from Memory

📚 Components

🤗 Transformers Cheatsheet

Overview

Tasks Supported

Framework Interoperability

Getting Started

Key Pipelines

Auto Classes

Model Training

Installation Tips

Saving and Loading Models

`HfApi` Client 🌐