Language Modeling - 搜索 News

Microsoft’s Phi-3 shows the surprising power of small, locally run AI language models

On Tuesday, Microsoft announced a new, freely available lightweight AI language model named Phi-3-mini, which is simpler and less expensive to operate than traditional large language models (LLMs) ...

MIT Technology Review

Anthropic can now track the bizarre inner workings of a large language model

What the firm found challenges some basic assumptions about how this technology really works. The AI firm Anthropic has developed a way to peer inside a large language model and watch what it does as ...

SiliconANGLE

Hugging Face open-sources world’s smallest vision language model

Hugging Face Inc. today open-sourced SmolVLM-256M, a new vision language model with the lowest parameter count in its category. The algorithm’s small footprint allows it to run on devices such as ...

Live Science

Large language models not fit for real-world use, scientists warn — even slight changes ...

Large language model AIs might seem smart on a surface level but they struggle to actually understand the real world and model it accurately, a new study finds. When you purchase through links on our ...

VentureBeat

Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Zyphra Technologies is announcing the ...

Wired

Small Language Models Are the New Rage, Researchers Say

The original version of this story appeared in Quanta Magazine. Large language models work well because they’re so large. The latest models from OpenAI, Meta, and DeepSeek use hundreds of billions of ...

Quanta Magazine

To Make Language Models Work Better, Researchers Sidestep Language

Language isn’t always necessary. While it certainly helps in getting across certain ideas, some neuroscientists have argued that many forms of human thought and reasoning don’t require the medium of ...

The Conversation

Large language models: how the AI behind the likes of ChatGPT actually works

Mark Stevenson has previously received funding from Google. The arrival of AI systems called large language models (LLMs), like OpenAI’s ChatGPT chatbot, has been heralded as the start of a new ...

Quanta Magazine

Why Language Models Are So Hard To Understand

You don’t typically build a machine without understanding how it works. But for artificial intelligence researchers building large language models, understanding is about the one thing they haven’t ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果