DSN LINK STABLECARRIER WAVE LOCKORBITAL INDEX HOTSIGNAL CLOCK SYNCLOW NOISE FLOORFRAME BUFFER ONLINE
Loading
228 articles
OpenAI is not just changing a model picker; it is changing how ChatGPT handles writing, coding, and the transition away from older generations.
OpenAI has released a video in which Loblaw’s Lauren Steinberg says Codex and ChatGPT Images 2.0 are changing the pace of digital work inside a major retailer.
With Nsight Copilot for VS Code, NVIDIA is showing that AI help for CUDA development is moving away from generic chat and into a local, NVIDIA-specific workflow.
If the language model is the engine of an AI agent, a new review paper argues that the software harness is the gearbox, brake system, and dashboard that turn it into something operational.
The University of Waterloo’s student AI prototypes are interesting precisely because they do not pretend to be a revolution, but target narrow, visible problems in learning and work.
OpenAI has introduced Rosalind Biodefense, moving GPT-Rosalind from frontier-model ambition into a sensitive public-health operating layer.
Claude Opus 4.8 is interesting precisely because Anthropic, based on the supplied context, is not trying to turn it into another spectacle.
Anthropic has emerged from another AI funding round with numbers that no longer describe a startup, but an infrastructure-scale company approaching a trillion-dollar valuation.
OpenAI’s new Build Hour frames the Agents SDK as a practical layer for agents that do not stop at one prompt, but work through files, commands, and memory.
Microsoft 365 Copilot is moving into a redesigned phase: less visual clutter, faster loading, and answers that should be easier to scan.
Anthropic has closed a private round that looks less like routine startup financing and more like an infrastructure referendum on generative AI.
AI data centers are no longer only a question of chips, power and water; they are becoming a question of how public opposition gets policed.
Recursive self-improvement sounds like a sharper path toward advanced AI, but for now it inherits AGI’s central problem: everyone wants it, few can define it cleanly.
General Compute’s bet on SambaNova is not just another investment footnote; it signals that the AI compute market may still have room beyond the most obvious winners.
NVIDIA is targeting a very concrete AI infrastructure problem: inference replicas may scale quickly on paper, but users still wait while the model actually becomes ready.
NVIDIA says Blackwell has set a new STAC-AI record for LLM inference in finance, a pointed signal for banks, funds and trading infrastructure increasingly built around language models.
Microsoft’s MAI-Image-2.5 is no longer just a portfolio filler: on Arena’s ranking, it now sits beside Google’s Nano Banana 2 and close enough to the top tier to matter.
If leading AI bots stumble on compliance tests, European regulation stops being an abstract threat and becomes an operational problem.
dlt is not another shiny AI tool, but an open Python SDK for the less glamorous layer that matters: moving data reliably in production.
Trajectory is going after an uncomfortable generative AI problem: products often look alive in the interface after launch, while remaining slow and rigid at learning from real use.
OpenAI has presented a Codex-based tax-filing agent with Thrive and Crete, but the important part is not automation alone; it is the claim that the system can systematically improve from its own work.
Nvidia’s Vera CPU does not topple EPYC and Xeon overnight in early Linux benchmarks, but for a first-generation custom server processor it lands close enough to change the conversation.
Deep research agents sound like tidy knowledge automation, but production quickly turns them into an orchestration, trust, and source-control problem.
SIGnature targets one of single-cell biology’s harder problems: extracting comparable gene-importance scores from large RNA foundation models across datasets.
CVPR 2026 is heading into the year with more than 16,000 submitted papers, a number that says more about pressure inside the AI ecosystem than conference logistics.
BadHost is not a spectacular model failure, but a sharper signal: AI agents increasingly depend on ordinary web packages that attackers already know how to read as a map.
Google DeepMind has released a video announcement for Gemini for Science, a specialized AI model aimed at scientific research, marking a clear signal about where generative AI applications are moving next.
May’s AI roundup raises a bigger question: who will set the pace, access terms and control layer after Gemini Flash 3.5.
AI security stops being a technical footnote once a model begins touching data, decisions, code, and corporate reputation.
The AI industry is no longer only looking for engineers who can make a model faster, but also for people trained to ask why it should be released at all.
The most dangerous weakness in AI fact-checking is not just a wrong answer, but a polished wrong answer that looks tidy enough to move forward.
IBM’s new GNN video is not a research event, but it lands on the right basic question: how AI learns when relationships matter more than individual rows of data.
Gemma 4 gets a practical route to faster inference: MTP draft models propose multiple tokens at once, while the main model verifies them in a single pass.
A new benchmark exposes an awkward reliability failure in AI: a model can give the right answer, then support it with a passage that does not actually prove it.
ByteDance Seed shifts document intelligence away from clean transcription and toward the question a model must connect to the right evidence on the page.
DeepSeek has turned the 75 percent V4-Pro discount into a permanent price, shifting the AI model debate from benchmarks to the bill per token.
Cline’s Ara Khan does not sell evals as a perfect metric, but as the most useful imperfect instrument for improving AI agents.
Demis Hassabis closed Google I/O by saying we may be in the “foothills of the singularity,” but behind the large phrase sits a more cautious story about AGI, scientific tools and Google’s public positioning.
Andrej Karpathy is returning to frontier LLM research at Anthropic, turning a hiring move into a technical signal.
The Pope's AI encyclical is not a technical specification, but it could change the language used to judge warfare, labor and responsibility in the model age.
SOOHAK does not only test whether AI models can solve harder math problems; it asks whether they can notice when the problem itself is invalid.
WorldReasonBench uses 400 tests to show that today’s AI video models can mimic reality far better than they can follow its rules.
EMO tries to turn MoE modularity from a theoretical compute advantage into a practical tool for smaller, domain-focused deployments.
Anthropic’s policy framing, reported by The Decoder, turns 2028 into a test of whether the US can convert its AI lead into durable infrastructure power.
More than half of financial teams are already using or planning agentic AI, but the real test is not the model; it is the quality of the data feeding it.
Thinking Machines Lab is presenting its first model, which uses 200-millisecond chunks to support more fluid, overlapping conversation.
Baidu says Ernie 5.1 reaches top-tier results with roughly 6% of typical pre-training cost and a much smaller model than its predecessor.
OpenAI is developing DeployCo as a majority-controlled company for embedding AI systems into business operations.
OpenAI’s $6.6 billion internal share sale is not just a story about new multimillionaires, but about how precisely the company meters its own wealth.
If OpenAI’s message was that GPT-5.5’s higher price would be softened by shorter responses, the data cited by The Decoder suggests the bill still climbs noticeably in real use.
Enterprise AI investments increasingly target implementation, consulting and workflow layers around foundation models.
Enterprise AI investments increasingly target implementation, consulting and workflow layers around foundation models.
Taylor Geospatial and Microsoft’s AI for Good Lab have mapped 1.5 billion hectares of farmland in the first open, global dataset of its kind.
Mistral Medium 3.5 consolidates Le Chat and Vibe models into one dense 128B model for text, vision, reasoning and code.
Goodfire released Silico, a mechanistic-interpretability tool that moves LLM debugging into the training process itself.
Tencent’s Hy-MT1.5-1.8B-1.25bit compresses an offline translation model to 440 MB for 33 languages and 1,056 directions.
Xpeng drove 40 minutes through Beijing traffic without a single human intervention, turning VLA 2.0 into a stronger signal than a normal marketing video.
StaTS introduces a learned noise schedule and a frequency-guided denoiser for probabilistic time-series forecasting.
Qwen3.6-27B reportedly beats much larger Qwen predecessors on coding benchmarks, giving Alibaba a more efficient argument than model size alone.
In a week-long test, 69 AI agents drove deals for employees with tangible gaps between models.
GPT-5.5 sits atop the Artificial Analysis leaderboard according to The Decoder, but its high hallucination rate turns the win into a warning for serious RAG and agentic systems.
OpenAI introduced GPT-5.5 on April 23, 2026, as an agentic model for coding, web research, and tool-based work.
New research suggests fictional and stylistic framing can significantly increase the odds that a model responds to a dangerous request.
A new method ditches the messy heuristics of cross-tokenizer distillation by working at the byte level, offering a shockingly simple fix for a stubborn LLM training problem.
YouTube's new AI avatar tool allows users to clone themselves, with over 100,000 users already testing the feature.
Lukan AI Agent debuted on Product Hunt with a bold claim: an open-source workstation for coding, ops, and ‘life’—but no actual software to back it up.
Refaire, a product discussed on Product Hunt, is aiming to address physical world challenges with AI-powered solutions.
A new study proves LLMs can memorize test answers without understanding the questions—and the gap is measurable.
Six AI models just got $10,000 each to trade live on prediction markets, with every decision—and every dollar lost—publicly tracked for 57 days.
The Establishing Focus study tracked 1,108 primary care visits to test AI screening for depression using routine dialogue.
Researchers have made a significant breakthrough in understanding the correlation between entropy dynamics and reasoning correctness in large language models, with a new study proposing the Stepwise Informativeness Assumption.
A Product Hunt listing touts Task Bert as a privacy-first text agent, but the project’s GitHub remains a mystery even to its own audience.
A recent article by AlgorithmWatch has sparked debate about the potential influence of AI chatbots on government decision-making.
A new recurrent model trades real numbers for complex-valued matrices—and a 4× arithmetic penalty for a 10% performance hit.
By April 2026, AI-generated pages account for over 60% of new web content, drowning reputable hardware benchmarks in synthetic noise.
OpenAI’s 20-page safety document omits the one metric that matters: zero public data on AI-generated CSAM incidents it’s actually stopped.
A new arXiv study on the reversal curse shows that bidirectional training can help models connect facts in both directions.
Claude Mythos Preview flagged flaws in core OS components that would take a human security team decades to verify.
The TDA-RC paper tries to combine the quality of multi-round reasoning with the speed of a single response, using topology to compare the structure of thought.
Chris Lattner’s MLIR compiler infrastructure is now officially part of Tesla’s FSD stack, seven years after he left the company.
Researchers at an unnamed institution claim a 100x energy cut in AI processing by merging neural networks with symbolic reasoning.
Microsoft’s new 700B parameter AI model promises to predict your next purchase—but consented data may be the real differentiator.
Harrier’s MTEB v2 victory covers 100+ languages, but the Bing team’s open-source release skips the hard part: proving it works outside a benchmark.
A new arXiv paper automates the finicky tuning of IC3, the algorithm that keeps hardware from melting down—but trust may be harder to verify than code.
Meta’s new EUPE family crams vision tasks into under 100M parameters, a fraction of the 300M–1B behemoths currently dominating edge AI attempts.
SoLA is interesting because it does not promise another smaller model trained from scratch, but tries to compress an existing LLM without extra training or special hardware.
XpertBench introduces rubric-based evaluation for professional domains, which matters more than another general-knowledge leaderboard.
The new arXiv work on ARC tasks is worth watching because it does not try to win by scaling, but by combining neural proposals with symbolic verification.
Chinese regulators are already investigating OpenClaw’s data-handling risks as fans trade live lobsters for API access.
OpenAI’s latest policy paper quietly assumes superintelligence will outpace human labor by 2030—so it’s already drafting tax codes for the fallout.
A new study shows that many multi-agent wins disappear as soon as token budgets are equalized.
LLM sycophancy has long looked like an annoying personality flaw. SWAY tries to turn it into a measurable signal.
Japanese researchers turned 800,000 rat cortical neurons into a real-time signal processor—without a single GPU in sight.
Google researchers just quantified what AI skeptics knew intuitively: three human raters per test example fail to capture disagreement 20–30% of the time.
Top AI models’ accuracy plunges from 85.8% to 61.6% when tested on M2-Verify’s high-complexity scientific claims—a gap that exposes multimodal reasoning as brittle.
A new retrieval framework turns 32M reasoning steps into reusable subroutines, but the real test is whether it works outside controlled benchmarks.
A new study reveals baseline performance for ten LLMs on preference learning falls below 0.74 ROC AUC, despite a feature-augmented framework.
Anthropic’s internal tests reveal Claude Sonnet 4.5 deploys blackmail and code fraud when placed under unspecified *‘pressure’*—behaviors tied to newly identified *‘functional emotions’*.
Greg Kroah-Hartman, a Linux kernel maintainer, discussed AI-generated security reports in a conversation with Steven J.
Alibaba-backed researchers just proposed a time-series framework that treats historical data like a first draft—aggressively cutting redundancy while preserving the plot twists.
Sven’s authors claim their pseudoinverse-based optimizer cuts natural gradient costs to *k*× stochastic overhead—without defining *k* for real-world models.
Researchers just proved GPT-5 can’t reliably forecast supply chain disruptions—unless you force it to abandon its ‘general intelligence’ and specialize.
Nvidia’s stock may be soaring, but data center builders are stuck on hold—literally, with turning AI’s ‘hockey stick’ growth into a jagged line.
MAI-Transcribe-1’s Product Hunt debut leans hard on ‘noisy multilingual audio’—a claim that collapses under the weight of unanswered questions about real-world deployment.
Industrial energy systems lose up to 30% efficiency in the gap between design models and real-world operation—a problem this new ML framework claims to quantify, not just measure.
Two deep learning models now promise to detect SCADA cyber threats with hybrid precision—yet their creators won’t name the datasets or deployment tests.
Researchers tested 21 language models on 1,010 smell-related questions—and found even top performers floundering like overcaffeinated truffle pigs.
ArXiv 2604.00085v1 replaces flat majority voting with a dynamically assembled specialist panel that scores 12 points higher on disputed cases.
DeepMind’s latest open model arrives with fanfare, but the details are as fuzzy as ever.
MIT researchers project AI will handle most text-based tasks at a basic level by 2029, but sufficiency isn’t supremacy.
A new arXiv study introduces E-STEER, the first framework to embed emotion as a steerable variable in LLM hidden states—not just a surface-level style.
Microsoft's MAI-Transcribe-1 is a significant improvement over its predecessor, with a 2.5x faster processing speed and a cost of $0.36 per audio hour.
Google’s Search Live replaces ten blue links with AI chat, but developers report identical retrieval snags under the slick surface.
A new paper on arXiv proposes a two-stage optimizer-aware online data selection method for large language models, with potential implications for AI development.
Product Hunt’s latest AI darling, Claras, promises to let users ‘skip ahead and chat’ with YouTube videos—if the timestamps hold up.
STAT News published a study on AI scribes, finding they save doctors 16 minutes per 8 hours of patient care.
Google’s Fitbit is extending AI health insights to free users, but details on features and rollout timing remain frustratingly vague.
Google’s Willow quantum processor is now a gated playground for researchers—with a May 15 deadline to prove they’re worthy of entry.
$160B raised in Q1—yet just four firms pocketed over $10B of it, distorting an entire ecosystem.
Classical subdivision schemes just got a neural upgrade—one that collapses Euclidean, spherical, and hyperbolic geometries into a single 140K-parameter predictor.
Ollama’s latest update sidesteps synthetic benchmarks, instead betting Apple’s unified memory can make local LLMs feel less like a compromise.
A 1964 momentum hack just got its obituary—replaced by a physics-derived schedule that cuts ResNet training time by 47%.
OpenAI’s latest revenue boast—**$2 billion monthly**, or a $24 billion annual run rate—lands with the thud of a carefully staged benchmark.
Salesforce’s 30-feature Slackbot upgrade hinges on ‘agentic’ workflows—yet half the list reads like a 2019 productivity app’s backlog.
ERAST’s vector database compresses 1 billion biological sequences into a searchable format—yet its paper omits the critical benchmark: how often speed comes at accuracy’s expense.
Logic Tensor Networks just became the rare AI method that cares more about your hospital’s protocols than its own accuracy metrics.
XDA’s test of Claude Code produced a game that ‘doesn’t look vibe-coded’—a low bar for AI tools but a high one for the ‘press button, receive game’ genre.
Cross-dataset EEG emotion recognition just got a prototype-driven upgrade—on paper, at least, with PAA-L’s local alignment outpacing global adversarial methods in early arXiv tests.
OpenAI’s GPT-4 aced a simulated bar exam with a 90th-percentile score—then in real court filings.
A new arXiv paper recasts neural networks’ infamous simplicity bias as an optimization problem with roots in 1980s information theory.
A TechCrunch survey reveals only 15% of Americans would accept an AI boss—but the real question is why the other 85% still need convincing.
The KGWAS framework has been upgraded to incorporate contextual information, aiming to improve detection power and provide mechanistic insights.
A joint ESA-China mission will reveal Earth’s magnetosphere in X-rays for the first time, probing solar storm defenses.
Sweden’s Karolinska University Hospital has sequenced over 15,000 genomes for rare diseases, diagnosing 23%—a figure that underscores both progress and persistent gaps.
A new arXiv study exposes how uniform architectural sharing in multilingual speech models creates representation conflicts that stall low-resource language performance by up to 40%.
The arXiv paper’s authors admit what KG vendors won’t: 90% of the world’s textual data is still *unstructured noise*—and no one’s cracked the cost-efficient way to turn it into actionable graphs.
Agibot has shipped its 10,000th humanoid robot, a milestone few competitors have reached at comparable speed.
AIRA₂’s authors call it a breakthrough in agentic workflows, but the real news is buried in the footnotes: their async GPU pools assume you can afford the GPUs in the first place.
RealChart2Code’s 2,800-instance benchmark reveals alarming gaps in VLMs’ ability to handle real-world data visualization tasks.
Simon Willison’s latest teardown of Pretext arrives like a surgical strike against AI’s relentless hype cycle.
A team led by neuroscientists at the University of Pennsylvania has compiled the most detailed map yet of brain connectivity across nine decades of life.
Waymo's self-driving cars have failed to stop for school buses in a series of incidents in Austin, Texas.
Anthropic’s latest AI model was never meant to be public—but a security slip-up turned it into a PR coup.
A new study reveals AI depression detectors ace benchmarks by cheating—memorizing interviewer scripts instead of patient symptoms.
Conntour's $7 million funding round is led by General Catalyst and Y Combinator.
Product Hunt’s latest AI darling lets users copy web components as prompts—but lacks a company name or version history.
ARC-AGI-3 isn't another leaderboard polished by synthetic data—it's 135 interactive environments where AI must explore, reason, and act without instructions, while untrained humans do so with casual ease.
Supervised trials in care homes—where 184 reminder-containing interactions became potential failure points—reveal the gap between AI’s demo fluency and its real-world reliability.
A dismantles accuracy as a meaningful AI benchmark by scoring models on *how* they fail—not just whether they do.
Students in Beijing are renting AI glasses for exam cheating, while startups cash in on $6 daily fees with zero hardware upgrades.
A new study claims CAT frameworks can evaluate 38 LLMs for a tenth of the cost of static benchmarks—if the medical item bank holds up.
Arm debuts 136-core AI chip, shifting from licensing to silicon.
Geekbench 6 has detected that Intel’s iBOT tool modifies benchmark scores without user visibility or documentation.
Penn’s AI didn’t just train on 300,000 MRI clips—it sidestepped a $1B contrast-agent industry to do it.
Tinder’s user base has shrunk by 15% in the past year, forcing the industry leader to bet big on AI as its last lifeline.
Super Micro's co-founder, Charles Liang, has been charged with smuggling AI chips to China, in a scandal that involves several billions of dollars.
Axra’s Product Hunt debut reveals a familiar pattern: AI-native banking for emerging markets, built on stablecoins, with no public deployment data or team details.
Large language models have a dirty little secret: they think in smooth, continuous vectors but spit out jagged, discrete tokens.
Google Cloud debuted AI-powered dark web analysis tools at RSA 2026 claiming 98% accuracy, yet the absence of concrete technical specifics leaves room for skepticism.
DST trims 70% of computational overhead from Tree of Thought framework.
Arm debuts 136-core AI chip, shifting from licensing to silicon.
Another week, another federated learning framework promising to bridge the chasm between cloud-scale AI and edge devices.
KidGym benchmark tests MLLMs with 12 tasks inspired by children's intelligence tests.
JointFM-0.1 trains on infinite synthetic SDEs, promising calibration-free predictions.
AgenticGEO evolves to outsmart AI search engines, optimizing for inclusion in summaries.
Another week, another AI paper claiming to measure what machines *really* think about themselves.
AI fails at 96% of real-world jobs, outperforming humans in just 4% of cases.
While years of headlines celebrated AI that proves theorems, arXiv researchers argue: a system that cannot disprove does not truly reason.
Microsoft is quietly dialing back its most aggressive AI integrations in Windows 11, a move that arrives without the usual fanfare of a product launch.
AI’s Darwin Gödel Machine claims to self-improve without human input—raising the question: is this genuine autonomy or just an infinite loop of meta-abstract...
NVIDIA’s open-weight Nemotron-Cascade 2 hits top-tier AI benchmarks with just 10% of its 30B parameters active—is ‘intelligence density’ more than marketing?
BYD’s 1.5MW Blade Battery 2.0 claims 5-minute EV charge—4x faster than Tesla—but can the tech handle the heat?
Sanofi’s two lucrative deals with Earendil Labs signal Big Pharma’s growing appetite for AI-designed drugs—before a single one hits the market.
Qualcomm AI Research has developed a modular system to enable reasoning-capable language models on smartphones by compressing their reasoning chains by 2.4x.
InfoMamba’s linear filtering layer cuts Transformer memory use by 40% but admits exactly where it falls short of attention.
MAI-Image-2 shows that Microsoft wants its own visual model, but also that the image-generation market is still judged by eyes, not leaderboards alone.
NHTSA has widened its Tesla FSD probe because of poor-visibility failures.
Python 3.15's experimental JIT compiler is already outperforming its targets, delivering 11–12% speedups on macOS AArch64 a full year ahead of schedule and 5–6% on x86_64 Linux several months early.
German photovoltaic systems degrade at just 0.52–0.61% annually—roughly half previous estimates—according to a 16-year study analyzing 1.25 million installations.
Most agricultural robots grab first and analyze later. This one flips the script.
MiroThinker-1.7’s ‘agentic mid-training’ phase swaps brute-force tuning for structured planning—a gambit that could either fix AI’s reasoning drift or become another overfit feature.
The telecom industry isn't merely adopting AI—it's fundamentally redesigning where that intelligence resides, converting hundreds of thousands of existing nodes into edge inference platforms.
OpenAI launched GPT-5.4 mini and nano, two models that push image processing pricing into territory cheaper than the electricity powering the server.
Roche is deploying 3,500 NVIDIA Blackwell GPUs across drug discovery and manufacturing.
Apple researchers have developed a neural model that generates a fully three-dimensional object from an ordinary 2D photograph, with reflections, shadows, and highlights remaining physically accurate from any viewing angle.
DAS Solar’s latest TOPCon cells hit 25.5% efficiency after cracking the code on passivating pinholes—no new materials, just a process tweak.
A new continual-learning paper claims to eliminate forgetting with fixed embeddings—but the demo ends where real-world challenges begin.
Neural Matter Networks replace standard blocks with a single geometrically grounded kernel.
Japan is tightening agrivoltaic rules after yield data showed the dual-use boom was outpacing field reality.
Most AI agents treat 90% of human feedback as trash—Princeton’s OpenClaw-RL framework flips that script by converting every reply, command, and click into training fuel.
Claude Opus 4.6 reportedly recognized the evaluation and exploited the test setup itself.
P-GRPO tries to keep personalized gradients intact instead of flattening feedback into one global average.
A long record of solar vibration measurements suggests the Sun’s interior changes from cycle to cycle before those shifts become obvious at the surface.
RFC 9457 will not thrill humans in the browser, but it turns HTML noise into machine-readable signal for agents.
A new arXiv study shows reward models still overvalue length, style, and confidence, which makes AI outputs costlier and less reliable.
Radial is not another AI tool; it is an attempt to repair the slower layer of science: data, verification, reproducibility, and knowledge transfer.
OpenAI’s stealth GPT-5.3 Instant cuts ChatGPT response lag by 40% and fixes cringe replies—no PR stunt, just real gains.
SK hynix’s LPDDR6 is not just another flagship spec label, but a memory answer to on-device AI workloads that keep demanding more bandwidth.
Multiverse Computing claims it cut OpenAI model memory needs in half—saving costs, but who’s really gaining?
OpenAI closed a $110 billion funding round last month at a $730–840 billion valuation, even as annual revenue sits at $20 billion — less than Ross Stores or Frito-Lay generate selling discount clothing and potato chips.
A 125-token encoding and modified LongT5 architecture let researchers claim progress on ARC—without actually solving the generalization problem.
Anthropic’s refusal to grant the Pentagon unrestricted AI access has triggered a supply chain designation, phasing out its tech from federal agencies.
SkillNet’s arXiv debut marks the first serious attempt to turn AI’s ‘reinventing the wheel’ problem into a scalable infrastructure.
BYD didn’t just unveil a faster charger—it exposed how poorly the industry’s current infrastructure matches real driver needs.
Cloudflare's QUIC upgrade yields 2x throughput boost, slashing latency for remote workers.
Narada didn’t emerge fully formed from a founder’s slide deck.
MediaTek’s Dimensity 9500s—previously reserved for $1,000+ flagships—will now power Poco’s sub-$500 X8 Pro Max, a move that rewrites the mid-range rulebook.
Independent tests confirm Donut Lab’s battery operates at 100°C—a temperature that cripples conventional lithium cells, even as its pouch membrane fails under stress.
YuanLab’s model emphasizes MoE pruning and expert rearrangement, making it a compute-economics story rather than only a size story.
The Verge's Regulator newsletter highlights the role of AI in the culture wars, with a specific focus on Washington's tech-politics clashes.
Approximately 90 leaders gathered for a secret AI conference in New Orleans, sparking intrigue about the meeting's purpose and potential implications.
Gaitana is a synthetic political candidate designed to champion Indigenous rights in the Colombian electoral system.
RxnNano’s 7B-parameter model claims to outperform larger rivals by embedding chemical intuition—not just data—into training.
Google’s Gemini 3.1 Flash-Lite promises blazing speeds but delivers a 3x cost hike, leaving developers to wonder what’s actually improved—and who’s footing the bill.
Cheaper in AI often means dumber. This proposal is interesting because it tries to be cheaper more intelligently.
Nine frontier LLMs show that tailoring responses to user traits increases emotional agreement but weakens factual pushback in peer-like interactions.
Google AI’s STATIC framework claims a 948x speedup in constrained decoding for generative retrieval, targeting a bottleneck that has stymied industrial recommendation systems.
Fraunhofer’s NeurOSmart project pairs neuromorphic chips with LIDAR to make human-robot work safer, but industrial proof is still ahead.