Large Language Model Training

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside ...

Communications of the ACMOpinion

From Model Training to Model Raising

A call to reform AI model-training paradigms from post hoc alignment to intrinsic, identity-based development.

Are Large Language Models A Dead End Or Simply Incomplete?

Once a model is deployed, its internal structure is effectively frozen. Any real learning happens elsewhere: through retraining cycles, fine-tuning jobs or external memory systems layered on top. The ...

Fast Company

OpenAI unveils its new GPT-4.5 large language model

OpenAI released a new base model on Thursday called GPT-4.5, which the company said is its best and smartest model for chat yet. It’s not a reasoning model like OpenAI’s o1 and o3 models, but it can ...

Communications of the ACM

The Swiss LLM Apertus

Apertus was released in early September 2025. It is a multilingual model developed by the Swiss Federal Institutes of Technology in Zurich (ETH) and Lausanne (EPFL). The model was pretrained with 60% ...

The Conversation

Large language models: how the AI behind the likes of ChatGPT actually works

Mark Stevenson has previously received funding from Google. The arrival of AI systems called large language models (LLMs), like OpenAI’s ChatGPT chatbot, has been heralded as the start of a new ...

The Economist

Forget DeepSeek. Large language models are getting cheaper still

As recently as 2022, just building a large language model (LLM) was a feat at the cutting edge of artificial-intelligence (AI) engineering. Three years on, experts are harder to impress. To really ...

The Hacker News

Microsoft Develops Scanner to Detect Backdoors in Open-Weight Large Language Models

Microsoft develops a lightweight scanner that detects backdoors in open-weight LLMs using three behavioral signals, improving ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results