发布日期:2026-03-13
收录条目:20
1. How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathy’s AutoResearch Framework for Hyperparameter Discovery and Experiment Tracking
- 来源:MarkTechPost
- 发布时间:2026-03-12 22:46 UTC
- 链接:https://www.marktechpost.com/2026/03/12/how-to-build-an-autonomous-machine-learning-research-loop-in-google-colab-using-andrej-karpathys-autoresearch-framework-for-hyperparameter-discovery-and-experiment-tracking/
摘要:In this tutorial, we implement a Colab-ready version of the AutoResearch framework originally proposed by Andrej Karpathy. We build an automated experimentation pipeline that clones the AutoResearch repository, prepares
2. Stanford Researchers Release OpenJarvis: A Local-First Framework for Building On-Device Personal AI Agents with Tools, Memory, and Learning
- 来源:MarkTechPost
- 发布时间:2026-03-12 21:21 UTC
- 链接:https://www.marktechpost.com/2026/03/12/stanford-researchers-release-openjarvis-a-local-first-framework-for-building-on-device-personal-ai-agents-with-tools-memory-and-learning/
摘要:Stanford researchers have introduced OpenJarvis, an open-source framework for building personal AI agents that run entirely on-device. The project comes from Stanford’s Scaling Intelligence Lab and is presented as both a
3. Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption
- 来源:AWS ML Blog
- 发布时间:2026-03-12 21:20 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/improve-operational-visibility-for-inference-workloads-on-amazon-bedrock-with-new-cloudwatch-metrics-for-ttft-and-estimated-quota-consumption/
摘要:Today, we’re announcing two new Amazon CloudWatch metrics for Amazon Bedrock, TimeToFirstToken and EstimatedTPMQuotaUsage. In this post, we cover how these work and how to set alarms, establish baselines, and proactively
4. Secure AI agents with Policy in Amazon Bedrock AgentCore
- 来源:AWS ML Blog
- 发布时间:2026-03-12 21:16 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/secure-ai-agents-with-policy-in-amazon-bedrock-agentcore/
摘要:In this post, you will understand how Policy in Amazon Bedrock AgentCore creates a deterministic enforcement layer that operates independently of the agent's own reasoning. You will learn how to turn natural language des
5. Facebook Marketplace adds AI auto-replies for annoying ‘Is this still available?’ messages
- 来源:The Verge AI
- 发布时间:2026-03-12 17:59 UTC
- 链接:https://www.theverge.com/tech/893907/facebook-marketplace-ai-auto-reply-listings
摘要:Facebook Marketplace is adding a bunch of new AI-powered tools that are supposed to make selling items on the platform a little more efficient. One feature will use Meta AI to automatically respond to those annoying "Is
6. Gemini’s task automation is here and it’s wild
- 来源:The Verge AI
- 发布时间:2026-03-12 16:59 UTC
- 链接:https://www.theverge.com/tech/893820/gemini-task-automation-samsung-s26-google-pixel-10
摘要:A couple of weeks ago, Google and Samsung announced a big Gemini development coming to their newest devices: task automation. Starting with food delivery and rideshare apps, Gemini would be able to use certain apps on yo
7. Anthropic’s Claude AI can respond with charts, diagrams, and other visuals now
- 来源:The Verge AI
- 发布时间:2026-03-12 16:00 UTC
- 链接:https://www.theverge.com/ai-artificial-intelligence/893625/anthropic-claude-ai-charts-diagrams
摘要:Anthropic's latest update to Claude will allow the AI chatbot to generate custom charts, diagrams, and other visualizations during your conversation. If Claude determines a visual is useful based on the context of your c
8. Multimodal embeddings at scale: AI data lake for media and entertainment workloads
- 来源:AWS ML Blog
- 发布时间:2026-03-12 15:59 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/multimodal-embeddings-at-scale-ai-data-lake-for-media-and-entertainment-workloads/
摘要:This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service. You will learn how to mo
9. Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation
- 来源:AWS ML Blog
- 发布时间:2026-03-12 15:57 UTC
- 链接:https://aws.amazon.com/blogs/machine-learning/fine-tuning-nvidia-nemotron-speech-asr-on-amazon-ec2-for-domain-adaptation/
摘要:In this post, we explore how to fine-tune a leaderboard-topping, NVIDIA Nemotron Speech Automatic Speech Recognition (ASR) model; Parakeet TDT 0.6B V2. Using synthetic speech data to achieve superior transcription result
10. Anthropic doesn’t trust the Pentagon, and neither should you
- 来源:The Verge AI
- 发布时间:2026-03-12 14:00 UTC
- 链接:https://www.theverge.com/podcast/893370/anthropic-pentagon-ai-mass-surveillance-nsa-privacy-spying
摘要:Today we’re talking about the messy, fast-moving situation at Anthropic, the maker of Claude that now finds itself in a very ugly legal battle with the Pentagon. The back-and-forth is complicated, but as of a few days ag
11. Bespoke AI models are the next big thing in filmmaking
- 来源:The Verge AI
- 发布时间:2026-03-12 13:56 UTC
- 链接:https://www.theverge.com/streaming/893538/ai-model-netflix-interpositive-ben-affleck
摘要:Though many AI boosters have convinced themselves that the technology can spit out films and television series whole cloth, claims of Hollywood being cooked feel very premature when you see what people are making with th
12. Microsoft’s Copilot Health can connect to your medical records and wearables
- 来源:The Verge AI
- 发布时间:2026-03-12 13:01 UTC
- 链接:https://www.theverge.com/tech/893594/microsoft-copilot-health-launch
摘要:Microsoft announced on Thursday that it's launching Copilot Health, a "separate, secure space" in Copilot for asking questions about lab results and medical records, searching for providers, analyzing data from wearables
13. You can now ask Google Maps ‘complex, real-world questions’ — and Gemini will answer
- 来源:The Verge AI
- 发布时间:2026-03-12 12:30 UTC
- 链接:https://www.theverge.com/tech/893262/google-maps-gemini-ai-ask-maps-immersive-navigation
摘要:Google is continuing to weave Gemini into the firmament of its most-used products. Today, it announced that Google Maps was getting a new AI-powered "Ask Maps" feature that allows for "complex, real-world questions" with
14. Perplexity’s Personal Computer turns your spare Mac into an AI agent
- 来源:The Verge AI
- 发布时间:2026-03-12 12:00 UTC
- 链接:https://www.theverge.com/ai-artificial-intelligence/893536/perplexitys-personal-computer-turns-your-spare-mac-into-an-ai-agent
摘要:Perplexity wants to be more than just an answer engine. On Wednesday, it launched Personal Computer, a new AI agent tool that can turn a spare Mac into a locally run AI system, pitching it as "a digital proxy for you." P
15. Agentic Control Center for Data Product Optimization
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10133
摘要:arXiv:2603.10133v1 Announce Type: new Abstract: Data products enable end users to gain greater insights about their data by providing supporting assets, such as example question-SQL pairs which can be answered using the
16. Hybrid Self-evolving Structured Memory for GUI Agents
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10291
摘要:arXiv:2603.10291v1 Announce Type: new Abstract: The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet real-world computer-use tasks remain
17. HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10359
摘要:arXiv:2603.10359v1 Announce Type: new Abstract: Distilling reasoning capabilities from Large Reasoning Models (LRMs) into smaller models is typically constrained by the limitation of rejection sampling. Standard methods
18. Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10384
摘要:arXiv:2603.10384v1 Announce Type: new Abstract: Evaluating LLM reliability via scalar probabilities often fails to capture the structural dynamics of reasoning. We introduce TRACED, a framework that assesses reasoning qu
19. Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10396
摘要:arXiv:2603.10396v1 Announce Type: new Abstract: Despite the growing demand for eliciting uncertainty from large language models (LLMs), empirical evidence suggests that LLM behavior is not always adequately captured by t
20. Resource-constrained Amazons chess decision framework integrating large language models and graph attention
- 来源:arXiv cs.AI
- 发布时间:2026-03-12 04:00 UTC
- 链接:https://arxiv.org/abs/2603.10512
摘要:arXiv:2603.10512v1 Announce Type: new Abstract: Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic p