Hugging Face

Gemma 2: Elevating Open Language Models with Practical Efficiency

Jun 27, 2024

AI, Generative AI, LLM, Open Source

Gemma, LLM, Google, Hugging Face, Open Source, Gemma 2

Here I am again with another language model… This time an open source one: Gemma 2, from Google! The world of artificial intelligence has witnessed an explosion in the capabilities of Large Language Models (LLMs)… These complex systems, trained on massive datasets, have demonstrated remarkable proficiency in understanding and generating human-like text, pushing the boundaries…
Uncensoring LLMs: Exploring the Power of Abliteration

Jun 16, 2024

AI, Generative AI, LLM

Abliteration, Censorship, Hugging Face, LLM, Uncensored LLM, Weight Orthogonalization

Today let’s talk about the limitations imposed by the built-in safety mechanisms of LLMs: while these safeguards are essential, they often feel like a straightjacket, preventing us from fully exploring the potential of these models. Recently, I stumbled across articles (a paper and an article on Hugging Face) about the concept of “abliteration”, a technique…
NVIDIA Released Nemotron-4 340B: Used in a Synthetic Data Generation Pipeline

Jun 15, 2024

AI, Generative AI, LLM, Open Source

Hugging Face, LLM, Nemotron, Nemotron-4-340B, NIVIDIA Open Model License, NVIDIA, Open Source, SDG Pipeline, Self-Reinforcement, Synthetic Data Generation

Let’s face it, when it comes to pushing the frontiers of AI, NVIDIA is right now in a league of its own… And the latest news from NVIDIA is? Nemotron-4 340B: a whole new family of language models where each model is part of a synthetic data generation pipeline able to generate high quality dataset…
Talking About Stable Diffusion 3: An Open-Source Image Generation

Jun 13, 2024

AI, Generative AI, Image Generation, Open Source

CLIP, Hugging Face, MMDiT, Multimodal Diffusion Transformer, Open Source, Rectified Flow, SD3, SD3 Medium, Stability AI, Stable Diffusion, T5

Stable Diffusion, spearheaded by Stability AI, has become synonymous with accessible and powerful AI image generation. With the release of Stable Diffusion 3 (SD3) the bar for open-source text-to-image generation has been significantly raised. Also, yesterday the weights of Stable Diffusion 3 Medium have been released on Hugging Face, so… It’s time for an article!…
Qwen2: Alibaba’s Open-Source LLM Evolves with Enhanced Capabilities and Multilingual Prowess

Jun 7, 2024

AI, Generative AI, LLM, Open Source

Alibaba, Apache 2.0, Github, Group Query Attention, Hugging Face, LLM, multilingual, Open Source, Qianwen License, Qwen2

Alibaba makes another impactful contribution to the open-source LLM landscape with the release of Qwen2, a substantial upgrade to its predecessor, Qwen1.5. Qwen2 arrives with an array of model sizes, expanded language support, and impressive performance enhancements, positioning it as a versatile tool for diverse AI applications. However, if you want more details go see the…
V-Express: Making Portraits Speak with Expressive Audio-Driven Video Generation

Jun 6, 2024

AI, Audio-Driven Animation, Generative AI, Human Image Animation, Open Source, Video Generation

AVSpeech, Diffusion Model, Github, Hugging Face, Latent Diffusion, Open Source, Stable Diffusion, Talking Face Generation, Talking-Head Video, TalkingHead-1KH, V-Express

The world of digital content creation is constantly evolving, with a growing demand for personalized and engaging experiences. One area experiencing a surge in popularity is portrait video generation, particularly the ability to generate realistic talking-head videos from a single image. It’s able for example to animate a family photo with a heartfelt message or…
ChatTTS: An Amazing Open Source TTS Model

May 31, 2024

AI, Generative AI, Open Source, Text to Speech

Attribution-NonCommercial 4.0 International license, ChatTTS, Github, Hugging Face, Open Source, Text to Speech, TTS

The world of text-to-speech (TTS) systems has come a long way. We’ve moved from robotic, flat voices to ones that sound surprisingly human… ChatTTS is a newcomer in this space, and it aims to change the way we interact with computers through natural-sounding speech. Not only, ChatTTS is released under the Attribution-NonCommercial 4.0 International license,…
Codestral: Code Generation Model by Mistral AI

May 31, 2024

AI, Code Language Model, Generative AI, LLM, Open Source

Code Completion, Code Generation, Codestral, Continue.dev, Fill-in-the-Middle (FIM), Hugging Face, La Plateforme, LangChain, Le Chat, LlamaIndex, Mistral AI, Mistral AI Non-Production License, Tabnine

Another model! Yes… This time we have a new Mistral AI’s release: Codestral! This open-weight, 22 billion parameter model is specifically designed to excel in code generation tasks across a vast range of programming languages… But, let’s stop the introduction here. For more details: Multilingual Mastery: Codestral’s Extensive Language Support Codestral boasts an impressive command…
Grounding DINO 1.5: Pushing the Boundaries of Open-Set Object Detection

May 22, 2024

AI, Object Detection, Open Source

Dual Encoder, Edge Device, Github, Grounding DINO 1.5, Grounding-20M Dataset, GroundingDINO, Hugging Face, Image Encoder, Object Detection, Real-time, Single Decoder, Text Encoder, Zero-shot, Zero-shot transfer

Imagine a world where computer vision breaks free from the limitations of rigid, pre-defined object categories, with picture detectors empowered by the richness of language, capable of recognizing objects they’ve never encountered before… This is the world that Grounding DINO 1.5 brings to life, a world where object detection transcends the visual and embraces the…
Hunyuan-DiT: Unlocking the Power of Chinese Text-to-Image Generation

May 18, 2024

AI, Generative AI, Image Generation, Open Source

CLIP, Diffusion Transformer, DiT, Github, Hugging Face, Hunyuan-DiT, multi-turn dialogue, Open Source, RoPE, T5, Tencent, VAE

Imagine typing a few lines of text, perhaps a verse from a Tang Dynasty poem or a description of a bustling Hong Kong street market, and watching as a stunningly realistic image materializes on your screen. This is the power of Hunyuan-DiT, a cutting-edge AI model developed by Tencent that excels in generating images from…