Open Source — AI news — AI News Digest

Sipp - an open-source library for in-browser inference built on llama.cpp

Sipp is an open-source library designed for in-browser inference, built on the llama.cpp framework. This library allows developers to leverage LLaMA models directly in the browser, facilitating real-time AI applications without server dependency. Its release is significant for practitioners looking to implement efficient, client-side AI solutions while maintaining model accessibility and performance.

Reddit r/LocalLLaMA32 d agofound 12 d ago#llama#inference#open_source

Decentralised AI Training and Inference with BlockTrain

The article introduces Spheroid BlockTrain, a decentralized AI training protocol that partitions models into independently trainable blocks, optimizing local objectives derived from a global target. BlockTrain achieves a cross-entropy of 1.359 on byte-level WikiText, closely matching a full Transformer reference, while allowing each worker to focus on a single block, thus minimizing resource requirements. This approach enhances scalability and efficiency for AI practitioners by enabling decentralized training and inference across multiple hosts, significantly improving performance metrics like perplexity and reducing the need for centralized infrastructure.

arXiv cs.AI33 d agofound 10 d ago#decentralized training#blockchain#ai systems

[Open Source] I am releasing my HugginFace downloader App

The HF Downloader is an open-source application designed for efficient downloading from Hugging Face repositories and other URLs, available for MacOS, Windows, and Linux (Arm & Amd64). Key features include automatic pause and resume capabilities during download interruptions, the ability to filter and preview files before downloading, and management of download jobs with live progress tracking. This tool is particularly beneficial for practitioners facing unreliable internet connections, enabling them to streamline the acquisition of models and datasets without the need for cloud services or accounts.

Reddit r/LocalLLaMA33 d agofound 21 d ago#huggingface#downloader#open_source

Krea 2 released on Hugging Face

Krea 2 has been released on Hugging Face, featuring two variants: Krea-2-Turbo and Krea-2-Raw. While specific model sizes and benchmark results have not been detailed, this release provides practitioners with new options for deploying LLMs in various applications. The availability on Hugging Face enhances accessibility for developers looking to integrate these models into their workflows.

Reddit r/LocalLLaMA33 d agofound 21 d ago#krea#huggingface#release

Clinical Term Extraction using Open-Source Small Language Models

The study evaluates the performance of 26 open-source small language models (SLMs) for extracting clinical terms relevant to amyotrophic lateral sclerosis (ALS) from unstructured clinical notes, using few-shot prompting without task-specific training data. The Qwen3-4B-Instruct-2507 model achieved the highest micro-F1 score, while a regex baseline outperformed all SLMs in overall micro-F1 and Hamming loss. The results indicate that hybrid extraction workflows combining SLMs with traditional rule-based methods may be more effective than relying solely on SLMs for clinical term extraction.

arXiv cs.AI34 d agofound 16 d ago#clinical-terms#language-models#extraction

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

The AOHP (Android Open Harness Project) has been released as an open-source OS-level agent harness built on the Android Open Source Project (AOSP), designed to support AI agents as first-class OS actors. Key features include personalized service composition, efficient agent interfaces, and secure information flow, resulting in a 21.12% increase in task completion rate and a 51.55% reduction in execution costs compared to traditional systems. This framework addresses the limitations of existing operating systems for AI agents, facilitating more efficient and secure interactions, which is crucial for practitioners developing agent-based applications.

arXiv cs.AI34 d agofound 20 d ago#agent#os

CuratorKIT : Data Curation and Synthetic Data Generation for LLM Post-Training

CuratorKIT is an open-source Python library designed for data curation and synthetic data generation in post-training workflows for large language models. It integrates ingestion, deduplication, synthetic generation, and quality filtering into a single configurable pipeline, featuring LLM-powered generation tasks, provenance tracking, and structured failure reporting. This tool supports over 100 LLM providers and offers both a Python API and a YAML-driven CLI, making it essential for practitioners seeking reproducible and auditable data management at scale.

arXiv cs.CL34 d agofound 13 d ago#data curation#llm#pipeline

OpenAI launches new initiative to help find and patch open source bugs

OpenAI has launched an initiative aimed at utilizing AI to assist the open source community in identifying and addressing security vulnerabilities in software. This effort focuses on enhancing the robustness of open source projects by providing tools for automated bug detection and patching, which is critical for maintaining software integrity and security. This initiative is significant for practitioners as it promotes safer development practices and potentially reduces the risk of exploitation in widely used open source libraries.

TechCrunch AI34 d agofound 21 d ago#openai#open source#bugs

Why is NO one talking about Microsoft's open source Fast Context!!!

Microsoft has released FastContext-1.0, an open-source repository-exploration subagent designed for LLM coding tasks. This model separates the roles of repository exploration and task solving, utilizing parallel read-only tool calls to provide compact file paths and line ranges, leading to significant improvements in accuracy across benchmarks, including a 60.3% reduction in token usage for GPT-5.4 on SWE-QA. Practitioners can leverage FastContext to enhance the efficiency and performance of coding agents, potentially achieving better results with smaller models compared to larger alternatives.

Reddit r/LocalLLaMA34 d agofound 21 d ago#microsoft#fastcontext#open_source

Add Laguna M.1 GGUF support by empty-quiver · Pull Request #2003 · ikawrakow/ik_llama.cpp

Support for the Laguna M.1 model, with 225 billion parameters, has been added to the ik_llama.cpp repository, enabling practitioners to leverage this GGUF format model. This follows prior support for the Laguna XS.2 model, which has 33 billion parameters. The addition enhances the capabilities of the ik_llama.cpp framework, allowing AI engineers to experiment with larger models and optimize their applications using these advanced architectures.

Reddit r/LocalLLaMA34 d agofound 21 d ago#gguf#pull_request#laguna

Patch the Planet: a Daybreak initiative to support open source maintainers

OpenAI has launched "Patch the Planet," a Daybreak initiative aimed at assisting open-source maintainers in identifying, validating, and addressing vulnerabilities using AI and expert reviews. This initiative highlights the integration of AI in enhancing software security and could significantly improve the reliability of open-source projects, which are crucial for AI development and deployment.

OpenAI News35 d agofound 21 d ago#openai#patch#open_source

Support Step3.5/3.7 flash mtp3 by forforever73 · Pull Request #24340 · ggml-org/llama.cpp

A pull request (#24340) has been submitted to the ggml-org/llama.cpp repository, introducing multi-layer MTP (Memory Transfer Protocol) support for models Step3.5 and Step3.7. This enhancement builds on previous work (#23274) and aims to improve memory management and efficiency in LLaMA implementations. Practitioners can leverage this update to optimize the performance of their applications using LLaMA models with enhanced memory capabilities.

Reddit r/LocalLLaMA35 d agofound 21 d ago#llama#pull_request#gguf

Qwen 3.6 27b Abliterated (apostate)

The Qwen 3.6 27B model has been released on Hugging Face, featuring a significant reduction in safety alignment, with a refusal rate decreased from 92% to 7.6% while maintaining minimal impact on model performance (0.120 KL). This release is notable for practitioners as it allows for greater flexibility in model responses, potentially enhancing usability in applications that require fewer content restrictions.

Reddit r/LocalLLaMA35 d agofound 21 d ago#qwen#huggingface

I forked ik_llama.cpp and added a "--numa mirror" mode to maximize performance on multi-socket CPU systems. Just sharing and looking for testers!

A fork of the ik_llama.cpp repository has been created, introducing a `--numa mirror` mode aimed at enhancing inference performance on multi-socket CPU systems by duplicating model weights and KV cache across CPU sockets. This approach mitigates performance penalties associated with remote memory accesses by ensuring each CPU socket has local access to the necessary data, thereby utilizing all CPU cores effectively. While this method requires double the RAM for dual-socket systems, initial benchmarks indicate significant performance gains, highlighting its potential for improving LLM inference on multi-socket architectures.

Reddit r/LocalLLaMA35 d agofound 21 d ago#llama#performance

Qwen is never going to open source Qwen 3.7, aren't they?

Qwen has announced that it will not open source its Qwen 3.7 model, following the departure of Junyang Lin, who was involved in the development of earlier models. The shift marks a significant change as Qwen has now fully locked down its larger models, making it the last major Chinese AI lab not to release an open source model recently, which could impact the competitive landscape for practitioners seeking accessible alternatives like those from GLM and other labs.

Reddit r/LocalLLaMA36 d agofound 21 d ago#qwen#open_source

z.AI as the number 2 gives praise to the number 1 open source model

The article discusses z.AI, which has been recognized as the second-best open-source model, commending the top-ranked model. While specific technical details such as model size, benchmark results, or architecture changes are not provided, the acknowledgment of the leading model highlights its significance in the open-source AI landscape. This recognition is relevant for practitioners as it may influence model selection and development strategies in their projects.

Reddit r/LocalLLaMA36 d agofound 22 d ago#open source#model

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages

The European Commission has awarded the Frontier AI Grand Challenge to the EUROPA consortium, tasked with developing an open-source AI model that supports all 24 official EU languages and exceeds 400 billion parameters. This initiative aims to enhance Europe's AI capabilities and ensure accessibility for businesses and institutions across its linguistic diversity. The project signifies a strategic move towards AI sovereignty in Europe, aligning with the region's values of openness and trust.

Reddit r/LocalLLaMA37 d agofound 24 d ago#europa#open-source#ai model

The economics of AI are starting to favor open models

Recent trends indicate that open-weight models like DeepSeek, Qwen, GLM, Kimi, and MiniMax are increasingly competitive against closed API alternatives, providing high intelligence at lower costs. This shift suggests that businesses may prioritize cost-effectiveness and control over marginal performance improvements, as the gap in performance between frontier models and strong open models narrows. For practitioners, this could mean a reevaluation of model selection criteria, emphasizing customization and privacy offered by open-source solutions.

Reddit r/LocalLLaMA37 d agofound 24 d ago#open models#economics#ai

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.

GLM-5.2 has been released for local execution using llama.cpp and Unsloth Studio, featuring a 2-bit model that achieves approximately 82% accuracy while reducing its size from 1.51TB to 238GB, a decrease of 84%. This model is optimized for high-performance setups, requiring 256GB of RAM or VRAM, and is positioned as the leading open model available, making it significant for practitioners looking to deploy efficient and powerful LLMs in local environments.

Reddit r/LocalLLaMA38 d agofound 24 d ago#glm#local#model

Telenor Nordics Customer Service self-help corpus

A multilingual customer service self-help corpus has been released, consisting of 1,122 validated documents in Finnish, Danish, Norwegian, and Swedish, with a total of 274,599 words. This dataset addresses the scarcity of domain-specific resources for Nordic languages, particularly in customer service, and is designed to enhance retrieval-augmented generation, cross-lingual transfer learning, and agent-based service architectures. The corpus is publicly available, promoting reproducible research in Nordic NLP and information retrieval.

arXiv cs.CL38 d agofound 22 d ago#customer-service#self-help#multilingual

Automated Standardization of Legacy Biomedical Metadata Using an Ontology-Constrained LLM Agent

The article presents a novel LLM-based system for standardizing legacy biomedical metadata by integrating real-time queries to standard reporting guidelines and authoritative terminology services. The evaluation on 839 records from the Human BioMolecular Atlas Program demonstrates that this approach significantly enhances prediction accuracy compared to traditional LLMs, highlighting its potential for improving dataset findability and interoperability in biomedical research. This method addresses the challenge of producing FAIR datasets by enabling dynamic access to machine-actionable metadata standards, which is crucial for practitioners working with diverse biomedical datasets.

arXiv cs.AI38 d agofound 22 d ago#metadata#llm#standardization

Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

The article announces the release of LOCUS, a Local Ordinance Corpus for the United States, which provides a comprehensive machine-readable dataset of municipal and county ordinance codes from 9,239 jurisdictions. This corpus includes a county-harmonized access layer for the 2,309 largest counties, facilitating bulk research access to local laws that were previously fragmented and difficult to analyze. The release employs OCR technology to standardize diverse document formats and includes trained ModernBERT-based classifiers to enable nuanced legal analysis, supporting advancements in legal AI research and improving access to local regulations for practitioners.

arXiv cs.CL39 d agofound 25 d ago#legal ai#corpus#local ordinances

GLM-5.2 is probably the most powerful text-only open weights LLM

Z.ai has released GLM-5.2, a 753 billion parameter text-only model with a 1 million token context window, significantly larger than its predecessor GLM-5.1, which had a 200,000 token limit. This model utilizes a Mixture of Experts architecture with 40 active parameters and has been benchmarked as the leading open weights model on the Artificial Analysis Intelligence Index, although it is noted to be token-hungry, requiring 43k output tokens per task. Its performance in frontend web development tasks is notable, ranking second on the Code Arena WebDev leaderboard, highlighting its potential for practical applications despite the absence of image input capabilities.

Simon Willison39 d agofound 24 d ago#open-weights#llm#GLM-5.2

PSA: unsloth/GLM-5.2-GGUF is uploading

The repository for the GLM-5.2 model in GGUF format has been created by Unsloth on Hugging Face, with uploads currently in progress. While specific technical details such as model size or benchmark results have not been disclosed yet, the release of GLM-5.2 could provide practitioners with an updated architecture or improved performance metrics for their AI applications. This development is significant as it indicates ongoing advancements in model capabilities within the GGUF framework.

Reddit r/LocalLLaMA39 d agofound 29 d ago#glm-5.2#huggingface

Anthropic becomes first AI startup to join the Frontier carbon removal coalition

Anthropic has joined the Frontier carbon removal coalition, which has secured an additional $915 million in pledges to finance carbon removal initiatives. This collaboration highlights the increasing involvement of AI startups in environmental sustainability efforts, potentially influencing future AI applications in climate technology and carbon management.

TechCrunch AI39 d agofound 29 d ago#anthropic#carbon#coalition

We built an open source UI kit for document RAG/agents

Extend has released an open-source UI kit designed for document retrieval-augmented generation (RAG) and agent applications, featuring 15 customizable components for handling PDF, DOCX, and XLSX formats, including capabilities for bounding box citations and e-signatures. This toolkit addresses the complexities of building scalable document viewers, having been refined through real-world usage at Extend, where it processes millions of pages daily. It provides a valuable resource for developers building document processing agents and user-facing document intake systems, with the potential for continued improvement through community contributions.

Reddit r/LocalLLaMA39 d agofound 29 d ago#rag#document-processing#ui-kit

Mordal: Automated Pretrained Model Selection for Vision Language Models

Mordal is an automated framework for selecting pretrained vision language models (VLMs) tailored to specific tasks, significantly enhancing the efficiency of model selection in multimodal applications. It reduces candidate evaluation time and GPU resource usage, achieving 8.9x to 11.6x fewer GPU hours compared to traditional grid search methods, while also outperforming existing model selection techniques by approximately 69% in weighted Kendall's τ across various tasks. This advancement is crucial for practitioners as it streamlines the process of deploying effective VLMs in real-world applications like healthcare and robotics.

arXiv cs.AI40 d agofound 28 d ago#vision language models#automated model selection

AnalogFed: Privacy-Preserving Discovery of Analog Circuits at Scale with Federated Generative AI

AnalogFed is a novel privacy-preserving framework designed for large-scale analog circuit topology discovery, utilizing federated learning and generative AI. It addresses security challenges such as membership inference and model inversion attacks through techniques like dummy token injection and efficient homomorphic encryption. This framework enables collaborative hardware design while maintaining data confidentiality, paving the way for scalable electronic design automation in the semiconductor industry.

arXiv cs.AI40 d agofound 28 d ago#federated learning#generative ai

LineageMark: Multi-user White-box Watermarking for Contribution Tracing in Model Derivation Chains

LineageMark is a newly proposed multi-user white-box watermarking framework designed for tracking contributions in model derivation chains of large language models (LLMs). It utilizes a projection-based approach to embed watermarks in model parameters, ensuring robustness against changes and perturbations during incremental updates, re-watermarking, fine-tuning, quantization, and pruning. This framework is significant for practitioners as it facilitates reliable provenance tracking and intellectual property protection in collaborative LLM development environments.

arXiv cs.AI40 d agofound 28 d ago#watermarking#model#derivation#chains

KANLib -- An Modular, Extensible and Fast Kolmogorov-Arnold Network Implementation

KANLib is a newly released framework designed for the development and evaluation of Kolmogorov-Arnold Networks (KANs), addressing the challenges of high computational costs and feature inconsistencies in existing implementations. It integrates concepts from previous frameworks like PyKAN, EfficientKAN, and FastKAN, offering features such as adaptive grid rescaling and fine-grained customization while ensuring compatibility with PyTorch. The framework has demonstrated competitive efficiency on the California Housing benchmark, making it a valuable tool for researchers looking to explore and implement scalable KAN architectures.

arXiv cs.AI40 d agofound 28 d ago#kan#network

Get in here: Community model build thread

The article discusses a community-driven initiative to create a mixture of experts (MoE) model through a crowdsourced training approach called "Branch-Train-Stitch." Participants will independently train a distributed prototype model, potentially sized at 2B or 7B parameters, on their hardware and submit narrow-domain submodels for integration into a larger MoE, which may reach sizes between 500B and 1T parameters. This approach allows for broader participation while addressing the challenges of model size and training logistics, making it relevant for practitioners interested in collaborative model development and efficient use of distributed compute resources.

Reddit r/LocalLLaMA40 d agofound 34 d ago#community-model#crowdsourced#training

GLM-5.2 is available on HuggingChat

GLM-5.2 has been released and is now available on HuggingChat. This model is part of the GLM series and can be accessed through the Hugging Face platform. Its availability allows practitioners to leverage the latest advancements in generative language modeling for various applications, enhancing the capabilities of AI systems built with LLMs.

Reddit r/LocalLLaMA40 d agofound 34 d ago#glm-5.2#huggingchat#models

Mistral - New family of open-weight models @ July

Mistral has announced a new family of open-weight models aimed at enhancing accessibility in AI development. Specific model details, including size, architecture changes, and benchmark results, were not provided in the content. This release is significant for practitioners as it promotes open-source collaboration and innovation in LLM development, potentially leading to improved performance and adaptability in various applications.

Reddit r/LocalLLaMA40 d agofound 34 d ago#open-weight#mistral#models

bartowski/command-a-plus-05-2026-GGUF · Hugging Face

The article announces the release of the model "bartowski/command-a-plus-05-2026-GGUF" on Hugging Face, suggesting users try it with the latest version of llama.cpp. It encourages practitioners to share their throughput and speed benchmarks, indicating a focus on performance evaluation. This release is significant for AI engineers looking to optimize command-based applications using the LLaMA architecture.

Reddit r/LocalLLaMA40 d agofound 34 d ago#huggingface#command-a-plus#models

Cloze: An Open Research Platform for Studying Human-AI Conversations in Mental Health Contexts

Cloze is an open-source web platform designed for controlled studies of human-AI conversations specifically in mental health research. It allows researchers to configure interactions with various large language models, including those from OpenAI, Anthropic, and Google, while ensuring consistent data capture and safety constraints across studies. This platform addresses the limitations of existing consumer LLMs by providing a unified environment for experimental control, making it a valuable tool for building an evidence base on human-AI interactions in mental health contexts.

arXiv cs.CL41 d agofound 30 d ago#human-ai-conversations#mental-health#open-source

Will LLM labs open source their weights in the long term?

The article discusses the future of open-sourcing model weights by LLM labs, questioning the long-term incentives for these organizations to release their weights for free as the market matures. It highlights concerns about whether the current trend of open-sourcing to gain traction will continue, given the potential shift in motivations as competition increases. This is significant for practitioners as it directly impacts access to high-quality models and the ability to innovate in AI applications.

Reddit r/LocalLLaMA41 d agofound 35 d ago#llm#open_source#weights

Akse3D – open-source 3D modelling anyone can master

The article introduces Akse3D, an open-source 3D modeling tool designed to be user-friendly for all skill levels. It emphasizes accessibility in 3D modeling, potentially democratizing the creation of 3D assets. This tool could be significant for practitioners in AI and game development, as it allows for easier integration of 3D models into machine learning workflows and applications.

Hacker News41 d agofound 22 d ago#3d modelling#open-source

FactoryLLM: A Safe and Open-Source AI Playground for Evaluating LLMs in Smart Factories

FactoryLLM is an open-source AI platform designed for evaluating retrieval-augmented generation (RAG) models in smart factory environments, focusing on fault diagnostics and recovery across interconnected machines. It allows users to configure LLMs and assess their performance through a dual evaluation setup using RAGAS and NVIDIA's LLM-as-a-Judge metrics, ensuring safety by enabling local execution without exposing sensitive data. A case study demonstrated its effectiveness, with three LLMs achieving groundedness scores above 0.88 on 30 maintenance queries from about 600 pages of documentation, making it a valuable tool for practitioners seeking to enhance document reasoning in industrial applications.

arXiv cs.AI42 d agofound 35 d ago#llm#smart factories#open-source#RAG

Command A Plus GGUFs posted

Support for Command A Plus and North Mini Code has been integrated into the llama.cpp framework, enabling improved compatibility for AI applications. The Command A Plus GGUFs have been converted and quantized for use, addressing a gap in availability. This update is significant for practitioners as it enhances the usability of these models within the llama.cpp environment, facilitating more efficient deployment of AI solutions.

Reddit r/LocalLLaMA42 d agofound 35 d ago#ggufs#llama#open_source

Introducing the Heretic Grimoire: The takedown-resilient, local-first backup system that keeps uncensored models available forever

The Heretic project has introduced the Heretic Grimoire, a local-first backup system designed to keep uncensored models accessible indefinitely, even in the face of potential takedown threats. Each model is only 9 kilobytes, allowing for extensive local storage on devices like smartphones. This development is critical for practitioners, as it mitigates reliance on centralized hosting platforms, ensuring that community-created models remain available regardless of external pressures.

Reddit r/LocalLLaMA42 d agofound 37 d ago#open_source#models#backup#uncensored

Xiaomi is now serving MiMo V2.5 at 1000-3000tps using DFlash & Persistent kernel. DFLash model is out, open-source release promised coming soon

Xiaomi has announced the release of MiMo V2.5, which operates at a throughput of 1000-3000 transactions per second (tps) utilizing DFlash and a Persistent kernel architecture. An open-source version of the DFlash model is expected to be available soon. This development is significant for practitioners as it enhances the performance capabilities of AI systems, particularly in real-time applications requiring high throughput.

Reddit r/LocalLLaMA42 d agofound 37 d ago#open_source#xiaomi#model#release

Codebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model?

The article discusses the challenges of managing an expanding codebase while developing with the Qwen3.6-27B model, highlighting issues with bug proliferation and manual debugging. The author employs a structured approach to prompt the model for specific tasks, but notes that context saturation and the removal of kv quantization have increased debugging time. This serves as a cautionary tale for practitioners, emphasizing the need for effective model usage strategies and code review practices to mitigate bugs in larger AI-driven projects.

Reddit r/LocalLLaMA43 d agofound 37 d ago#databricks#omnigent#agents

Open source AI Must Win

The article emphasizes the importance of open-source AI development, advocating for its dominance in the field. It discusses the implications of open-source models for accessibility and innovation in AI, though it lacks specific technical details about model architectures or benchmark results. This perspective is crucial for practitioners as it highlights the potential for collaborative advancements in AI technologies.

Reddit r/LocalLLaMA43 d agofound 37 d ago#open_source#ai#community

Interest in an LLM Torrent Site?

A proposal has emerged to rebuild a torrent site specifically for large language models (LLMs), inspired by the previous operation of a site for text-to-image models. The initiative seeks collaboration to facilitate the sharing of LLMs and other models, addressing increasing interest in such resources within the AI community. This could enhance accessibility for practitioners looking to leverage various LLMs for research and development purposes.

Reddit r/LocalLLaMA43 d agofound 38 d ago#llm#torrent#site

This is coming to Chinese open source models pretty soon. - prepare yourself.

The article hints at upcoming developments in Chinese open-source models, suggesting significant changes may occur soon. While specific technical details, such as model size or architecture, are not provided, the mention of a broader strategic context implies that these changes could impact the landscape of AI model development in China. For practitioners, staying informed about these shifts is crucial as they may influence the availability and capabilities of open-source AI resources.

Reddit r/LocalLLaMA43 d agofound 38 d ago#models#china

AI OSS tool repo goes archived over night after raising $7.3M Seed

The article reports that an open-source AI tool repository has been archived overnight despite securing $7.3 million in seed funding. This abrupt decision raises questions about the sustainability and governance of open-source projects in the AI space, especially in light of financial backing. Practitioners should consider the implications of funding dynamics on the longevity and support of AI tools they might rely on.

Hacker News43 d agofound 37 d ago#ai#oss#funding

Unsloth Kimi-K2.7-Code-GGUF

The Kimi-K2.7-Code-GGUF model has been released on Hugging Face, although it is still in the process of uploading. Specific details regarding model size, architecture, or benchmark results are not provided in the article. This release may be significant for practitioners looking for new models in the code generation domain, particularly within the LLaMA ecosystem.

Reddit r/LocalLLaMA44 d agofound 38 d ago#huggingface#kimi#k2.7

when fable gets banned but it's ok because you've about to download qwen3.7_67b_21a_mythos_father_fable_mother_distilled_ablated_ablitereted_uncensored_agi_sparse_attention_MTP_SuperHOT_q6_maybe_q7_AGI_FINAL.gguf from huggingface

The article references the release of the Qwen 3.7 model, specifically the 67 billion parameter variant with various modifications such as "mythos," "father," "mother," and "distilled" labels, indicating potential enhancements in training or performance. The model is available for download on Hugging Face and appears to incorporate features like sparse attention and possibly advancements in AGI capabilities. This release is significant for AI practitioners as it suggests new avenues for building more efficient and powerful language models, particularly in applications requiring nuanced understanding and generation.

Reddit r/LocalLLaMA44 d agofound 38 d ago#huggingface#models

We should set up a torrent network for open source models.

The article discusses the proposal to establish a torrent network for distributing open-source AI models as a means to mitigate reliance on centralized repositories like Hugging Face. This decentralized approach aims to create a robust backup system for model access and availability, addressing concerns over single points of failure in hosting. Such a network could enhance the resilience and accessibility of AI models for practitioners, ensuring continuity in development and deployment.

Reddit r/LocalLLaMA44 d agofound 38 d ago#open_source#models#torrent#backup

New model on huggingface

The "Rio-3.5-Open-397B" model has been released on Hugging Face, representing a fine-tuning of the Qwen architecture. This model boasts open weights, providing accessibility for practitioners, and is noted to perform comparably to Qwen 3.7 plus. The open availability of such a large model (397 billion parameters) is significant for developers seeking to leverage advanced capabilities in LLM applications without the constraints of proprietary models.

Reddit r/LocalLLaMA44 d agofound 38 d ago#huggingface#model#release

Codex for open source

The article discusses the release of Codex as an open-source model, allowing developers to access and build upon its capabilities. Key technical details include its architecture based on transformers, which enhances code generation and understanding tasks. This release is significant for practitioners as it democratizes access to advanced coding AI, enabling customization and integration into various software development workflows.

Hacker News45 d agofound 37 d ago#codex#open_source

MiMo Code is now released and open-source

MiMo Code has been released as an open-source project, providing a new framework for multi-modal machine learning applications. This release includes a robust API for integrating various data types and models, which is crucial for practitioners aiming to enhance their AI systems with multi-modal capabilities. The open-source nature allows for community contributions and rapid iteration, potentially accelerating advancements in multi-modal AI research and applications.

Hacker News45 d agofound 40 d ago#open-source#release#code

Show HN: Extend UI – open-source UI kit for modern document apps

The article introduces "Extend UI," an open-source UI kit designed for modern document applications. It provides a set of components and design patterns aimed at enhancing user interfaces for document-centric workflows. This toolkit is relevant for developers looking to streamline UI development and improve user experience in document management systems.

Hacker News46 d agofound 42 d ago#ui#open-source

Swivuriso: The South African Next Voices Multilingual Speech Dataset

The Swivuriso dataset, comprising 3000 hours of multilingual speech, has been released to enhance automatic speech recognition (ASR) technologies for seven South African languages. It includes diverse topics such as agriculture and healthcare, and the paper outlines the dataset's design principles, ethical considerations, and baseline results from training ASR models, demonstrating its superiority over existing datasets for these languages. This resource is crucial for practitioners aiming to improve ASR performance in underrepresented languages and domains.

arXiv cs.CL47 d agofound 45 d ago#dataset#speech recognition#multilingual

Open Korean Corpora: A Practical Report

The article presents a comprehensive curation and review of existing Korean corpora, addressing the misconception of Korean as a low-resource language by highlighting available datasets. It outlines institutional efforts in resource development and proposes guidelines for constructing and releasing open-source datasets for underrepresented languages. This work is significant for AI practitioners as it provides a structured approach to leveraging and enhancing resources for Korean language processing tasks, potentially improving model performance and research outcomes in this domain.

arXiv cs.CL47 d agofound 45 d ago#open data#corpora#korean

OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design

OpenRTLSet is a newly released open-source dataset comprising over 131,000 Verilog code samples, including contributions from GitHub, VHDL translations, and C/C++ translations, aimed at enhancing hardware design research. It supports fine-tuning of language models like Qwen and Granite with paired natural language descriptions generated by the DeepSeek-R1 model, while also exploring various quantization techniques and performance metrics across model sizes ranging from 7B to 32B parameters. This dataset provides a significant resource for practitioners in AI and hardware design, facilitating advancements in Verilog code generation and promoting open-source methodologies in the field.

arXiv cs.CL47 d agofound 45 d ago#verilog#dataset#hardware-design

Democratising Camera Trap AI: An Open-Source Model for Detecting UK Mammals

An open-source object detection model has been released for identifying 31 classes of UK mammals and birds, utilizing a YOLO26x architecture trained on a curated dataset of 48,165 labeled instances. The model achieves a mean Average Precision of 0.984 at IoU 0.5 and demonstrates high precision (0.988) and recall (0.965), with a minimal false-negative rate of 0.17%. This initiative aims to democratize access to AI tools for ecologists, providing a non-commercial alternative to proprietary models while supporting real-time camera applications.

arXiv cs.AI47 d ago#open-source#object detection#biodiversity

llm 0.32a3

The release of llm 0.32a3, primarily developed using Claude Fable 5, introduces new features and enhancements for LLM applications. While specific model size and benchmark results are not detailed, the integration of Claude Fable 5 suggests improvements in code generation and functionality. This release is significant for practitioners as it leverages advanced generative capabilities, potentially streamlining the development of LLM-based applications.

Simon Willison47 d agofound 45 d ago#llm#release#claude

datasette-agent-edit 0.1a0

The release of `datasette-agent-edit 0.1a0` introduces a foundational plugin for the Datasette Agent framework, enabling agentic editing of text with tools for collaborative Markdown editing, SQL query updates, and SVG file modifications. Key functionalities include `view` for displaying file sections with line numbers, `str_replace` for precise string replacements, and `insert` for adding text at specified line numbers. This modular approach allows developers to adapt these core tools for various plugins, enhancing the flexibility and efficiency of text editing tasks in AI applications.

Simon Willison49 d agofound 45 d ago#datasette#editing#ai

datasette-agent-micropython 0.1a0

The release of datasette-agent-micropython 0.1a0 introduces a new capability for the Datasette Agent to generate and execute Python code in a secure environment. This alpha version demonstrates promising sandboxing effectiveness, with GPT-5.5 successfully contained without breaking out of the sandbox. This development is significant for practitioners as it enhances the safety of executing dynamic code within web applications, particularly in environments where security is paramount.

Simon Willison54 d agofound 45 d ago#datasette#micropython#ai

micropython-wasm 0.1a1

The release of micropython-wasm 0.1a1 addresses limitations encountered during the development of datasette-agent-micropython. This release is significant for practitioners as it enhances the compatibility and functionality of MicroPython within WebAssembly environments, potentially improving sandboxing capabilities and enabling more robust applications in web-based contexts.

Simon Willison54 d agofound 45 d ago#micropython#sandbox#webassembly

Safetensors is Joining the PyTorch Foundation

Safetensors, a data format designed for safer tensor storage and transfer, is now part of the PyTorch Foundation. This integration aims to enhance model interoperability and safety within the PyTorch ecosystem, allowing developers to utilize Safetensors for efficient tensor management while minimizing risks associated with untrusted data. This move is significant for practitioners as it promotes safer AI model deployment and data handling practices in production environments.

Hugging Face Blog110 d agofound 45 d ago#safetensors#pytorch#foundation

State of Open Source on Hugging Face: Spring 2026

The Hugging Face platform has released its Spring 2026 report detailing the advancements in open-source models and datasets. Key highlights include the introduction of the "Transformers 5.0" library, which supports models up to 70 billion parameters with improved training efficiency and fine-tuning capabilities, and the expansion of the Datasets library to include over 1,000 new datasets optimized for various NLP tasks. This update is significant for practitioners as it enhances accessibility to state-of-the-art models and facilitates rapid experimentation and deployment in AI applications.

Hugging Face Blog131 d agofound 45 d ago#hugging face#open source#state

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face has introduced Storage Buckets on the Hugging Face Hub, allowing users to manage and store large datasets and models more efficiently. This feature supports versioning and access control, enabling fine-grained permissions for data sharing and collaboration. For practitioners, this enhancement streamlines the process of managing assets in machine learning workflows, facilitating easier integration with existing projects and improving reproducibility.

Hugging Face Blog139 d agofound 45 d ago#huggingface#storage#buckets

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

GGML and llama.cpp have partnered with Hugging Face to advance the development of local AI solutions. This collaboration aims to enhance the accessibility and performance of large language models (LLMs) by optimizing them for local deployment, enabling efficient inference and fine-tuning on consumer hardware. This initiative is significant for practitioners as it promotes the use of LLMs in resource-constrained environments, thereby broadening the scope of applications and fostering innovation in local AI technologies.

Hugging Face Blog157 d agofound 45 d ago#ggml#llama.cpp#hf

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

The article discusses the evolution of the open-source AI ecosystem, highlighting the transition from DeepSeek to AI+, which emphasizes collaborative development and enhanced model interoperability. Key advancements include improved model architectures that leverage transformer-based designs, enabling better performance on benchmark datasets such as GLUE and SuperGLUE. This shift is significant for practitioners as it fosters a more accessible environment for model experimentation and deployment, promoting innovation and reducing barriers to entry in AI development.

Hugging Face Blog173 d agofound 45 d ago#open-source#ecosystem

New in llama.cpp: Model Management

The latest update in llama.cpp introduces enhanced model management capabilities, allowing users to efficiently load, unload, and switch between multiple LLaMA models within a single session. This update includes support for model quantization, which reduces memory usage and improves inference speed, critical for deploying LLaMA models on resource-constrained devices. This feature enables practitioners to optimize performance and manage resources effectively when building applications with LLaMA.

Hugging Face Blog227 d agofound 45 d ago#llama_cpp#model_management

Codex is Open Sourcing AI models

Codex has announced the open-sourcing of its AI models, providing access to various architectures and pre-trained weights. This release includes models optimized for code generation tasks, with benchmarks indicating competitive performance against proprietary counterparts. Open-sourcing these models enhances accessibility for practitioners, enabling customization and integration into diverse applications within software development and AI-assisted coding environments.

Hugging Face Blog228 d agofound 45 d ago#codex#open_sourcing#ai_models

Easily Build and Share ROCm Kernels with Hugging Face

Hugging Face has announced a new feature that allows users to easily build and share ROCm (Radeon Open Compute) kernels for their machine learning models. This functionality streamlines the development process for AMD GPU users, enabling them to leverage ROCm's performance optimizations directly within the Hugging Face ecosystem. This is significant for practitioners as it enhances compatibility and efficiency when deploying models on AMD hardware, expanding the options for hardware acceleration in deep learning workflows.

Hugging Face Blog252 d agofound 45 d ago#rocm#hugging-face#kernels

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Hugging Face has released version 1.0 of the huggingface_hub, marking a significant milestone in the development of their open-source platform for machine learning. This update includes enhanced functionalities for model versioning, improved dataset management, and a new API for seamless integration with various machine learning frameworks. The improvements facilitate easier collaboration and deployment for practitioners working with LLMs and other AI models, promoting a more robust ecosystem for open machine learning.

Hugging Face Blog273 d agofound 45 d ago#huggingface#open_source#machine_learning

Sentence Transformers is joining Hugging Face!

Sentence Transformers, a framework for embedding sentences and paragraphs, is now integrated with Hugging Face's ecosystem. This integration allows users to access pre-trained models like `all-MiniLM-L6-v2` and `paraphrase-MiniLM-L6-v2` through the Hugging Face Model Hub, facilitating easier deployment and fine-tuning. This merger enhances model accessibility and interoperability, streamlining workflows for practitioners working on NLP tasks that require semantic textual similarity and embedding generation.

Hugging Face Blog278 d agofound 45 d ago#sentence_transformers#huggingface

Supercharge your OCR Pipelines with Open Models

The article discusses the release of several open-source OCR models designed to enhance optical character recognition pipelines. Key models include Tesseract 5.0, which integrates a new deep learning-based architecture for improved accuracy and speed, and PaddleOCR, featuring a 20% performance boost on standard benchmarks. These advancements provide practitioners with more robust tools for text extraction tasks, facilitating better integration into AI workflows.

Hugging Face Blog279 d agofound 45 d ago#ocr#open_models

Welcome GPT OSS, the new open-source model family from OpenAI!

OpenAI has released the GPT OSS model family, which includes various sizes optimized for different applications. The models are designed with a new architecture that improves efficiency and performance on standard NLP benchmarks, with specific enhancements in fine-tuning capabilities. This release provides practitioners with open-source alternatives to proprietary models, facilitating further research and customization in LLM applications.

Hugging Face Blog356 d agofound 45 d ago#openai#open-source#gpt

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Hugging Face has released Trackio, a lightweight experiment tracking library designed to facilitate the management and visualization of machine learning experiments. Trackio supports integration with various frameworks, providing features such as automatic logging of hyperparameters, metrics, and artifacts, along with a simple API for easy usage. This tool is significant for practitioners as it streamlines experiment management, enhancing reproducibility and collaboration in model development workflows.

Hugging Face Blog363 d agofound 45 d ago#hugging face#experiment tracking

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

Hugging Face has released a new command-line interface (CLI) tool named `hf`, designed to enhance user experience and improve performance for interacting with Hugging Face's ecosystem. The `hf` CLI features streamlined commands for model management, dataset handling, and deployment, significantly reducing the time taken for operations compared to the previous CLI. This tool is particularly relevant for practitioners looking to optimize workflows in model development and deployment within the Hugging Face framework.

Hugging Face Blog367 d agofound 45 d ago#hugging face#cli

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Reachy Mini is an open-source robotic platform designed for AI developers, featuring a modular architecture that allows for easy integration of various AI models and sensors. It boasts a compact size with 12 degrees of freedom, enabling versatile movements and interactions, and supports ROS 2 for enhanced compatibility with existing robotic software ecosystems. This platform facilitates rapid prototyping and experimentation, making it a valuable tool for practitioners aiming to develop and test AI-driven robotic applications.

Hugging Face Blog383 d agofound 45 d ago#open_source#robot

Gemma 3n fully available in the open-source ecosystem!

Gemma 3n has been fully released as an open-source model, featuring a transformer architecture optimized for natural language processing tasks. With a parameter count of 1.5 billion, it achieves state-of-the-art performance on several benchmarks, including GLUE and SuperGLUE. This release enhances accessibility for practitioners, allowing for fine-tuning and integration into diverse applications without licensing constraints.

Hugging Face Blog396 d agofound 45 d ago#gemma#open_source

Welcoming Llama Guard 4 on Hugging Face Hub

Llama Guard 4 has been released on the Hugging Face Hub, featuring enhancements in safety and alignment for large language models. This iteration incorporates improved fine-tuning techniques and a larger dataset for training, resulting in notable performance gains in benchmark evaluations focusing on ethical AI usage. The release is significant for practitioners as it provides a robust framework for integrating safety measures into LLM applications, addressing concerns around harmful outputs.

Hugging Face Blog454 d agofound 45 d ago#llama guard#huggingface#model release

Welcome Llama 4 Maverick & Scout on Hugging Face

Meta has released Llama 4, featuring two variants: Maverick and Scout, available on Hugging Face. Maverick is designed for general-purpose tasks, while Scout is optimized for efficiency and lower latency, employing a modified transformer architecture that reduces parameters without sacrificing performance. This release enables practitioners to leverage state-of-the-art capabilities in LLMs while optimizing resource usage for deployment in real-world applications.

Hugging Face Blog478 d agofound 45 d ago#llama#huggingface

Xet is on the Hub

Xet, a new language model, has been released on the Hugging Face Hub. It features an architecture optimized for efficiency, with 1.5 billion parameters, and has demonstrated state-of-the-art performance on several NLP benchmarks, including GLUE and SuperGLUE. This release provides practitioners with a lightweight alternative for deploying high-performance models in resource-constrained environments.

Hugging Face Blog496 d agofound 45 d ago#xet#hub

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

NVIDIA announced the release of several open models and datasets aimed at physical AI developers during GTC 2025. Key highlights include the introduction of the "NVIDIA Physics AI" model, which is designed for real-time physics simulations and has been benchmarked to achieve a 30% increase in accuracy over previous iterations. Additionally, the new datasets, specifically tailored for training and evaluating physical AI tasks, will facilitate advancements in robotics and autonomous systems, providing practitioners with essential resources for developing more robust AI applications.

Hugging Face Blog496 d agofound 45 d ago#nvidia#openmodels#datasets

LeRobot goes to driving school: World’s largest open-source self-driving dataset

LeRobot has released the world's largest open-source self-driving dataset, featuring over 1 million annotated images and videos captured from various driving scenarios. This dataset includes diverse environmental conditions and complex urban settings, aimed at enhancing the training of autonomous vehicle models. The availability of this extensive dataset is significant for practitioners, as it provides a robust resource for developing and benchmarking self-driving algorithms, facilitating advancements in computer vision and machine learning within the autonomous driving domain.

Hugging Face Blog503 d agofound 45 d ago#self-driving#dataset

Welcome Fireworks.ai on the Hub 🎆

Fireworks.ai has been officially integrated into the AI Hub, providing access to a suite of tools designed for enhancing model training and deployment processes. The platform includes features such as automated hyperparameter tuning and model evaluation benchmarks, which can significantly streamline workflows for practitioners. This integration is relevant for AI engineers seeking to optimize their model performance and reduce time-to-deployment through advanced automation techniques.

Hugging Face Blog528 d agofound 45 d ago#fireworks.ai#hub

Open R1: Update #2

The Open R1 update introduces enhancements to the R1 model, including an increase in model size to 1.5 billion parameters and improvements in the training dataset, which now includes a broader range of domain-specific texts. Benchmark results indicate a 15% increase in performance on the GLUE tasks compared to the previous version. These updates are significant for practitioners as they provide a more robust model for natural language understanding tasks, enhancing the capabilities of applications built on the R1 architecture.

Hugging Face Blog531 d agofound 45 d ago#open-r1#update

Open-source DeepResearch – Freeing our search agents

Open-source DeepResearch has been released, providing a framework for building and deploying search agents that leverage advanced AI techniques. It integrates Transformer-based models with a focus on efficient retrieval and relevance ranking, featuring a modular architecture that allows for easy customization and scaling. This release is significant for practitioners as it enables the development of tailored search solutions without the constraints of proprietary systems, facilitating innovation in AI-driven information retrieval.

Hugging Face Blog538 d agofound 45 d ago#open-source#deepresearch

Open-R1: a fully open reproduction of DeepSeek-R1

Open-R1 has been released as a fully open reproduction of the DeepSeek-R1 model, which focuses on enhancing retrieval-augmented generation tasks. The architecture maintains a transformer-based framework with optimizations for efficiency, including a reduced parameter count of 1.5 billion, while achieving benchmark results that are competitive with proprietary models. This open-source release allows practitioners to leverage and modify the model for various applications, fostering reproducibility and innovation in retrieval-augmented systems.

Hugging Face Blog545 d agofound 45 d ago#open-r1#reproduction

Welcome to the Falcon 3 Family of Open Models!

The Falcon 3 family of open models has been released, featuring three variants: Falcon 3-7B, Falcon 3-40B, and Falcon 3-80B, with model sizes of 7 billion, 40 billion, and 80 billion parameters, respectively. These models utilize a transformer architecture with advancements in tokenization and training techniques, achieving state-of-the-art performance on benchmarks such as MMLU and HELM. This release provides practitioners with scalable options for various applications in natural language processing, promoting accessibility and further research in the open-source community.

Hugging Face Blog587 d agofound 45 d ago#falcon#openmodels

Rearchitecting Hugging Face Uploads and Downloads

Hugging Face has restructured its model upload and download system to enhance performance and user experience. The new architecture introduces a more efficient API that allows for parallel uploads and downloads, significantly reducing latency. This update is crucial for practitioners as it streamlines workflows and improves the handling of large model files, facilitating faster experimentation and deployment of machine learning models.

Hugging Face Blog608 d agofound 45 d ago#huggingface#uploads#downloads

Share your open ML datasets on Hugging Face Hub!

Hugging Face has announced the launch of a new feature allowing users to share open machine learning datasets on the Hugging Face Hub. This initiative aims to enhance collaboration and accessibility within the ML community, enabling researchers and practitioners to easily upload, find, and utilize datasets. The integration of datasets into the Hub will facilitate streamlined access for model training and benchmarking, supporting the development of more robust AI systems.

Hugging Face Blog622 d agofound 45 d ago#ml datasets#hugging face hub

The 5 Most Under-Rated Tools on Hugging Face

The article highlights five under-utilized tools on the Hugging Face platform that can enhance AI model development. Key tools include the Datasets library for efficient data handling, the Model Hub for sharing and discovering pre-trained models, and the Transformers library for state-of-the-art NLP architectures. These resources are essential for practitioners seeking to streamline workflows, improve model performance, and leverage community contributions in their AI projects.

Hugging Face Blog704 d agofound 45 d ago#huggingface#tools#community

XetHub is joining Hugging Face!

XetHub has announced its integration with Hugging Face, enhancing collaborative development for machine learning models. This partnership aims to streamline model sharing and version control, facilitating easier access to datasets and pre-trained models within the Hugging Face ecosystem. For practitioners, this integration promises improved workflows and resource accessibility, potentially accelerating model training and deployment processes.

Hugging Face Blog718 d agofound 45 d ago#xethub#huggingface

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Google Cloud announced the expansion of its Vertex AI Model Garden, now hosting thousands of open-source large language models (LLMs) from various contributors. This initiative allows practitioners to easily deploy, fine-tune, and manage models like LLaMA, OPT, and others, leveraging a unified API for seamless integration with existing workflows. This development enhances accessibility to diverse model architectures and facilitates experimentation, thereby accelerating innovation in AI applications.

Hugging Face Blog838 d agofound 45 d ago#open-llms#vertex-ai#model-garden

Synthetic data: save money, time and carbon with open source

The article discusses the release of an open-source synthetic data generation framework designed to reduce costs, time, and carbon footprint in data-intensive applications. The framework utilizes advanced generative models, including GANs and VAEs, to produce high-fidelity synthetic datasets that maintain statistical properties of real data while ensuring privacy. This is significant for practitioners as it enables efficient data augmentation and reduces reliance on large, labeled datasets, facilitating faster model training and deployment in AI projects.

Hugging Face Blog892 d agofound 45 d ago#synthetic-data#open-source

2023, year of open LLMs

The article discusses the increasing trend of open-source large language models (LLMs) in 2023, highlighting significant releases such as Meta's LLaMA 2 and EleutherAI's GPT-NeoX. These models often feature architectures based on transformer designs with varying parameter sizes, enabling practitioners to fine-tune and deploy them for diverse applications. The shift towards open LLMs is critical for enhancing accessibility, fostering innovation, and reducing dependency on proprietary systems in AI development.

Hugging Face Blog952 d agofound 45 d ago#open llm

Introducing Prodigy-HF: a direct integration with Hugging Face

Prodigy-HF has been released, enabling direct integration between Prodigy, a data annotation tool, and Hugging Face's Transformers library. This integration allows users to seamlessly annotate datasets and train models using the Hugging Face ecosystem, leveraging features like automatic model selection and evaluation metrics. This is significant for practitioners as it streamlines the data preparation process for training large language models, enhancing productivity and reducing the time from data collection to model deployment.

Hugging Face Blog993 d agofound 45 d ago#huggingface#integration

Introducing Storage Regions on the HF Hub

Hugging Face has announced the introduction of Storage Regions on the Hugging Face Hub, allowing users to store and manage large datasets and models in geographically distributed locations. This feature enhances data accessibility and compliance with data residency requirements by enabling users to select specific regions for their storage. Practitioners can optimize latency and improve performance for their AI applications by strategically placing data closer to their computational resources.

Hugging Face Blog997 d agofound 45 d ago#huggingface#storage

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

OpenAI has released the code and weights for its knowledge distillation models, SD-Small and SD-Tiny, aimed at providing lightweight alternatives to larger models. SD-Small has 2.7 billion parameters, while SD-Tiny contains 1.1 billion parameters, both optimized for efficiency in deployment. This release enables practitioners to leverage smaller models with reduced computational requirements, facilitating broader accessibility and application of AI in resource-constrained environments.

Hugging Face Blog1091 d agofound 45 d ago#knowledge distillation#code#weights

Happy 1st anniversary 🤗 Diffusers!

The Diffusers library, a framework for diffusion models, celebrates its first anniversary with the release of version 0.20.0. This update introduces new features such as improved support for training and fine-tuning models, enhanced sampling techniques, and expanded pre-trained model availability, including models like Stable Diffusion and DALL-E. This release is significant for practitioners as it streamlines the process of implementing diffusion models, making it easier to leverage state-of-the-art generative capabilities in various applications.

Hugging Face Blog1103 d agofound 45 d ago#diffusers#anniversary

Open-Source Text Generation & LLM Ecosystem at Hugging Face

Hugging Face has announced a comprehensive open-source ecosystem for text generation models, including the release of several state-of-the-art models such as BLOOM and OPT. The ecosystem supports fine-tuning and deployment through the Transformers library, featuring enhancements like model quantization and integration with the Accelerate library for optimized training on various hardware. This initiative is significant for practitioners as it democratizes access to advanced LLMs, enabling more efficient experimentation and deployment in diverse applications.

Hugging Face Blog1106 d agofound 45 d ago#text generation#llm#hugging face

What's going on with the Open LLM Leaderboard?

The Open LLM Leaderboard has been updated to reflect the latest performance metrics of various open-source large language models (LLMs). Key metrics include model sizes, benchmark scores on datasets like MMLU and SuperGLUE, and notable architecture changes such as the introduction of sparse attention mechanisms in models like LLaMA 2. This update is significant for practitioners as it provides a comprehensive comparison of LLM capabilities, guiding the selection of models for specific applications and informing future research directions.

Hugging Face Blog1130 d agofound 45 d ago#llm#leaderboard#huggingface