AI 每日资讯 - 2026-03-13

发布日期：2026-03-13

收录条目：20

1. How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathy’s AutoResearch Framework for Hyperparameter Discovery and Experiment Tracking

来源：MarkTechPost
发布时间：2026-03-12 22:46 UTC
链接：https://www.marktechpost.com/2026/03/12/how-to-build-an-autonomous-machine-learning-research-loop-in-google-colab-using-andrej-karpathys-autoresearch-framework-for-hyperparameter-discovery-and-experiment-tracking/

摘要：In this tutorial, we implement a Colab-ready version of the AutoResearch framework originally proposed by Andrej Karpathy. We build an automated experimentation pipeline that clones the AutoResearch repository, prepares

2. Stanford Researchers Release OpenJarvis: A Local-First Framework for Building On-Device Personal AI Agents with Tools, Memory, and Learning

来源：MarkTechPost
发布时间：2026-03-12 21:21 UTC
链接：https://www.marktechpost.com/2026/03/12/stanford-researchers-release-openjarvis-a-local-first-framework-for-building-on-device-personal-ai-agents-with-tools-memory-and-learning/

摘要：Stanford researchers have introduced OpenJarvis, an open-source framework for building personal AI agents that run entirely on-device. The project comes from Stanford’s Scaling Intelligence Lab and is presented as both a

3. Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

来源：AWS ML Blog
发布时间：2026-03-12 21:20 UTC
链接：https://aws.amazon.com/blogs/machine-learning/improve-operational-visibility-for-inference-workloads-on-amazon-bedrock-with-new-cloudwatch-metrics-for-ttft-and-estimated-quota-consumption/

摘要：Today, we’re announcing two new Amazon CloudWatch metrics for Amazon Bedrock, TimeToFirstToken and EstimatedTPMQuotaUsage. In this post, we cover how these work and how to set alarms, establish baselines, and proactively

4. Secure AI agents with Policy in Amazon Bedrock AgentCore

来源：AWS ML Blog
发布时间：2026-03-12 21:16 UTC
链接：https://aws.amazon.com/blogs/machine-learning/secure-ai-agents-with-policy-in-amazon-bedrock-agentcore/

摘要：In this post, you will understand how Policy in Amazon Bedrock AgentCore creates a deterministic enforcement layer that operates independently of the agent's own reasoning. You will learn how to turn natural language des

5. Facebook Marketplace adds AI auto-replies for annoying ‘Is this still available?’ messages

来源：The Verge AI
发布时间：2026-03-12 17:59 UTC
链接：https://www.theverge.com/tech/893907/facebook-marketplace-ai-auto-reply-listings

摘要：Facebook Marketplace is adding a bunch of new AI-powered tools that are supposed to make selling items on the platform a little more efficient. One feature will use Meta AI to automatically respond to those annoying "Is

6. Gemini’s task automation is here and it’s wild

来源：The Verge AI
发布时间：2026-03-12 16:59 UTC
链接：https://www.theverge.com/tech/893820/gemini-task-automation-samsung-s26-google-pixel-10

摘要：A couple of weeks ago, Google and Samsung announced a big Gemini development coming to their newest devices: task automation. Starting with food delivery and rideshare apps, Gemini would be able to use certain apps on yo

7. Anthropic’s Claude AI can respond with charts, diagrams, and other visuals now

来源：The Verge AI
发布时间：2026-03-12 16:00 UTC
链接：https://www.theverge.com/ai-artificial-intelligence/893625/anthropic-claude-ai-charts-diagrams

摘要：Anthropic's latest update to Claude will allow the AI chatbot to generate custom charts, diagrams, and other visualizations during your conversation. If Claude determines a visual is useful based on the context of your c

8. Multimodal embeddings at scale: AI data lake for media and entertainment workloads

来源：AWS ML Blog
发布时间：2026-03-12 15:59 UTC
链接：https://aws.amazon.com/blogs/machine-learning/multimodal-embeddings-at-scale-ai-data-lake-for-media-and-entertainment-workloads/

摘要：This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service. You will learn how to mo

9. Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

来源：AWS ML Blog
发布时间：2026-03-12 15:57 UTC
链接：https://aws.amazon.com/blogs/machine-learning/fine-tuning-nvidia-nemotron-speech-asr-on-amazon-ec2-for-domain-adaptation/

摘要：In this post, we explore how to fine-tune a leaderboard-topping, NVIDIA Nemotron Speech Automatic Speech Recognition (ASR) model; Parakeet TDT 0.6B V2. Using synthetic speech data to achieve superior transcription result

10. Anthropic doesn’t trust the Pentagon, and neither should you

来源：The Verge AI
发布时间：2026-03-12 14:00 UTC
链接：https://www.theverge.com/podcast/893370/anthropic-pentagon-ai-mass-surveillance-nsa-privacy-spying

摘要：Today we’re talking about the messy, fast-moving situation at Anthropic, the maker of Claude that now finds itself in a very ugly legal battle with the Pentagon. The back-and-forth is complicated, but as of a few days ag

11. Bespoke AI models are the next big thing in filmmaking

来源：The Verge AI
发布时间：2026-03-12 13:56 UTC
链接：https://www.theverge.com/streaming/893538/ai-model-netflix-interpositive-ben-affleck

摘要：Though many AI boosters have convinced themselves that the technology can spit out films and television series whole cloth, claims of Hollywood being cooked feel very premature when you see what people are making with th

12. Microsoft’s Copilot Health can connect to your medical records and wearables

来源：The Verge AI
发布时间：2026-03-12 13:01 UTC
链接：https://www.theverge.com/tech/893594/microsoft-copilot-health-launch

摘要：Microsoft announced on Thursday that it's launching Copilot Health, a "separate, secure space" in Copilot for asking questions about lab results and medical records, searching for providers, analyzing data from wearables

13. You can now ask Google Maps ‘complex, real-world questions’ — and Gemini will answer

来源：The Verge AI
发布时间：2026-03-12 12:30 UTC
链接：https://www.theverge.com/tech/893262/google-maps-gemini-ai-ask-maps-immersive-navigation

摘要：Google is continuing to weave Gemini into the firmament of its most-used products. Today, it announced that Google Maps was getting a new AI-powered "Ask Maps" feature that allows for "complex, real-world questions" with

14. Perplexity’s Personal Computer turns your spare Mac into an AI agent

来源：The Verge AI
发布时间：2026-03-12 12:00 UTC
链接：https://www.theverge.com/ai-artificial-intelligence/893536/perplexitys-personal-computer-turns-your-spare-mac-into-an-ai-agent

摘要：Perplexity wants to be more than just an answer engine. On Wednesday, it launched Personal Computer, a new AI agent tool that can turn a spare Mac into a locally run AI system, pitching it as "a digital proxy for you." P

15. Agentic Control Center for Data Product Optimization

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10133

摘要：arXiv:2603.10133v1 Announce Type: new Abstract: Data products enable end users to gain greater insights about their data by providing supporting assets, such as example question-SQL pairs which can be answered using the

16. Hybrid Self-evolving Structured Memory for GUI Agents

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10291

摘要：arXiv:2603.10291v1 Announce Type: new Abstract: The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet real-world computer-use tasks remain

17. HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10359

摘要：arXiv:2603.10359v1 Announce Type: new Abstract: Distilling reasoning capabilities from Large Reasoning Models (LRMs) into smaller models is typically constrained by the limitation of rejection sampling. Standard methods

18. Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10384

摘要：arXiv:2603.10384v1 Announce Type: new Abstract: Evaluating LLM reliability via scalar probabilities often fails to capture the structural dynamics of reasoning. We introduce TRACED, a framework that assesses reasoning qu

19. Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10396

摘要：arXiv:2603.10396v1 Announce Type: new Abstract: Despite the growing demand for eliciting uncertainty from large language models (LLMs), empirical evidence suggests that LLM behavior is not always adequately captured by t

20. Resource-constrained Amazons chess decision framework integrating large language models and graph attention

来源：arXiv cs.AI
发布时间：2026-03-12 04:00 UTC
链接：https://arxiv.org/abs/2603.10512

摘要：arXiv:2603.10512v1 Announce Type: new Abstract: Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic p

菜单

分享

AI 每日资讯 - 2026-03-13

1. How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathy’s AutoResearch Framework for Hyperparameter Discovery and Experiment Tracking

2. Stanford Researchers Release OpenJarvis: A Local-First Framework for Building On-Device Personal AI Agents with Tools, Memory, and Learning

3. Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

4. Secure AI agents with Policy in Amazon Bedrock AgentCore

5. Facebook Marketplace adds AI auto-replies for annoying ‘Is this still available?’ messages

6. Gemini’s task automation is here and it’s wild

7. Anthropic’s Claude AI can respond with charts, diagrams, and other visuals now

8. Multimodal embeddings at scale: AI data lake for media and entertainment workloads

9. Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

10. Anthropic doesn’t trust the Pentagon, and neither should you

11. Bespoke AI models are the next big thing in filmmaking

12. Microsoft’s Copilot Health can connect to your medical records and wearables

13. You can now ask Google Maps ‘complex, real-world questions’ — and Gemini will answer

14. Perplexity’s Personal Computer turns your spare Mac into an AI agent

15. Agentic Control Center for Data Product Optimization

16. Hybrid Self-evolving Structured Memory for GUI Agents

17. HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

18. Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

19. Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities

20. Resource-constrained Amazons chess decision framework integrating large language models and graph attention

评论

A2A 初理解：让 AI Agent 真正“互相协作”的通用协议

slow op的排查手段（更新中）

asan内存检测

模型即芯片：AI 推理新分叉

rclone拷贝桶对象失败定位过程

训练初了解：把大模型看成一个复杂函数（通俗版）

vector扩容

智能指针是线程安全的？

ceph中 RBD 使用

cas 无锁编程