发布日期:2026-06-19
收录条目:20
1. Monitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch
- 来源:AWS ML Blog
- 发布时间:2026-06-18 23:31 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/monitor-and-debug-generative-ai-inference-with-sagemaker-detailed-metrics-and-insights-dashboard-on-cloudwatch/
摘要:Amazon SageMaker AI provides fully managed real-time inference hosting for machine learning models. You deploy a model to a SageMaker endpoint backed by one or more compute instances, and SageMaker handles provisioning a
2. Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an Agent’s Work and Learns Overnight
- 来源:MarkTechPost
- 发布时间:2026-06-18 20:26 UTC
- 链接:https://www.marktechpost.com/2026/06/18/perplexity-launches-brain/
摘要:Perplexity has launched Brain, a self-improving memory system for its Computer agent. Instead of remembering the user, Brain remembers the agent's work — what worked, what failed, and what corrections got made. It builds
3. Amazon Bedrock AgentCore harness is now generally available: Go from idea to production-grade agent in minutes
- 来源:AWS ML Blog
- 发布时间:2026-06-18 17:32 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/amazon-bedrock-agentcore-harness-is-now-generally-available-go-from-idea-to-production-grade-agent-in-minutes/
摘要:Today, Amazon Bedrock AgentCore harness is generally available. Two API calls (CreateHarness to define an agent, and InvokeHarness to run it), and you have an agent running in seconds. The agent runs in its own isolated
4. New usage analytics and updated spend controls for enterprises
- 来源:OpenAI News
- 发布时间:2026-06-18 17:00 UTC
- 链接:https://openai.com/index/chatgpt-enterprise-spend-controls
摘要:OpenAI introduces new spend controls and usage analytics for ChatGPT Enterprise, helping organizations manage costs and scale AI with confidence.
5. Amazon employees say they’re facing termination for backing data center limits
- 来源:The Verge AI
- 发布时间:2026-06-18 16:00 UTC
- 链接:https://www.theverge.com/ai-artificial-intelligence/952180/amazon-seattle-data-center-moratorium-aecj-disciplinary-action
摘要:When three Amazon software engineers testified earlier this month at Seattle City Council hearings about data centers, they started their testimony by citing a city law barring employment discrimination over political sp
6. Who decides when AI is too dangerous?
- 来源:The Verge AI
- 发布时间:2026-06-18 14:00 UTC
- 链接:https://www.theverge.com/podcast/951542/anthropic-claude-fable-5-mythos-ban-pentagon-ai-regulation-trump
摘要:On today’s episode of Decoder, my guest is Hayden Field, senior AI reporter for The Verge. Often when Hayden comes on the show, it’s because something has gone wrong in the world of AI. Last weekend, that something was a
7. Photoshop and Premiere now have AI assistants
- 来源:The Verge AI
- 发布时间:2026-06-18 13:00 UTC
- 链接:https://www.theverge.com/tech/952099/adobe-ai-assistants-photoshop-premiere-illustrator-beta-launch
摘要:Adobe's plan to stick AI assistants into all of its Creative Cloud suite is now fully underway, with new chatbots now rolling out to its biggest editing and design apps. As part of a public beta launching today, Photosho
8. Adobe’s redesigned AI studio remembers what your creations look like
- 来源:The Verge AI
- 发布时间:2026-06-18 13:00 UTC
- 链接:https://www.theverge.com/tech/952104/adobe-firefly-ai-agent-elements-projects-update
摘要:Adobe is introducing some new capabilities for its Firefly AI assistant, alongside a "reimagined" AI studio that lets you edit and generate new designs from a single interface. The new Firefly experience launching today
9. Improving health intelligence in ChatGPT
- 来源:OpenAI News
- 发布时间:2026-06-18 11:00 UTC
- 链接:https://openai.com/index/improving-health-intelligence-in-chatgpt
摘要:Learn how GPT-5.5 Instant improves ChatGPT’s health and wellness responses with stronger reasoning, better context, clearer communication, and physician-informed evaluations.
10. The KV Cache Compression Race: TurboQuant vs OSCAR vs EpiCache
- 来源:MarkTechPost
- 发布时间:2026-06-18 09:14 UTC
- 链接:https://www.marktechpost.com/2026/06/18/the-kv-cache-compression-race-turboquant-vs-oscar-vs-epicache/
摘要:The KV cache now outweighs model weights at long context. Here's how TurboQuant, OSCAR, and EpiCache each attack that memory bottleneck — and why they're more complementary than competitive. The post The KV Cache Compres
11. Using AI to help physicians diagnose rare genetic diseases affecting children
- 来源:OpenAI News
- 发布时间:2026-06-18 08:00 UTC
- 链接:https://openai.com/index/diagnose-rare-childhood-diseases
摘要:Researchers used an OpenAI reasoning model to help diagnose rare diseases, identifying 18 new diagnoses in previously unsolved cases.
12. NAVI-Orbital: First In-Orbit Demonstration of a Zero-Shot Vision-Language Model for Autonomous Earth Observation
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18271
摘要:arXiv:2606.18271v1 Announce Type: new Abstract: As Earth Observation data generation outpaces downlink bandwidth and human-in-the-loop processing, a widening gap has emerged between onboard collection and actionable grou
13. CaVe-VLM-CoT: An Interpretable Vision-Language Model Framework
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18385
摘要:arXiv:2606.18385v1 Announce Type: new Abstract: Vision-Language Models (VLMs) remain prone to hallucinations, producing fluent but visually unfaithful outputs. Existing chain-of-thought and retrieval-augmented methods on
14. Searching for Synergy in Shared Workspace Human-AI Collaboration
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18413
摘要:arXiv:2606.18413v1 Announce Type: new Abstract: Automated AI agents are increasingly capable, yet many scientific and professional tasks require human judgment and contextual expertise. We study shared-workspace human-AI
15. CEO-Bench: Can Agents Play the Long Game?
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18543
摘要:arXiv:2606.18543v1 Announce Type: new Abstract: Language model agents are becoming proficient executors at isolated, short-horizon tasks such as software engineering and customer service. Yet real-world challenges requir
16. DeFAb: A Verifiable Benchmark for Defeasible Abduction in Foundation Models
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18557
摘要:arXiv:2606.18557v1 Announce Type: new Abstract: A rule-based logic solver resolves every instance in our benchmark in under 50 microseconds with 100% accuracy; the best frontier language model reaches 65% at best and dro
17. Optimizing Lithium Production Decisions under Geological, Demand, and Pricing Uncertainties: A POMDP Framework for Multi-Objective Decision Making
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18598
摘要:arXiv:2606.18598v1 Announce Type: new Abstract: Decision making in lithium production is challenging, whether from an investor's perspective or a strategic production standpoint. Determining which mines to open and when
18. ForecastBench-Sim: A Simulated-World Forecasting Benchmark
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18686
摘要:arXiv:2606.18686v1 Announce Type: new Abstract: Forecasting benchmarks for general-purpose AI systems usually inherit the constraints of the real world: outcomes resolve slowly, tail events are rare, and counterfactual q
19. What Must Generalist Agents Remember?
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18746
摘要:arXiv:2606.18746v1 Announce Type: new Abstract: This paper develops a formal account of what generalist agents must store in memory in order to act near-optimally across multiple environments and goals. It shows that whe
20. R2D-RL: A RoboCup 2D Soccer Environment for Multi-Agent Reinforcement Learning
- 来源:arXiv cs.AI
- 发布时间:2026-06-18 04:00 UTC
- 链接:https://arxiv.org/abs/2606.18786
摘要:arXiv:2606.18786v1 Announce Type: new Abstract: Robot soccer is a challenging testbed for multi-agent reinforcement learning because it combines partial observability, cooperative and adversarial interaction, sparse rewa