[{"data":1,"prerenderedAt":205},["ShallowReactive",2],{"DlFXI4Eibt_Bn9lrEZz1TYbHCWFZj3IvqwHQSEW-Exc":3,"9oOYAgQaUWH9NdV4z9JdoInFZ9M51dj3LwUTdW1FdbU":194},{"code":4,"msg":5,"data":6},0,"",{"category":7,"tag":11,"hot":39,"new":78,"banner":118,"data":143,"cache":193},[8,9,10],"Agent","OpenAI","LLM",[12,14,17,20,23,25,27,30,33,36],{"title":8,"total":13},39,{"title":15,"total":16},"Google",44,{"title":18,"total":19},"Nvidia",13,{"title":21,"total":22},"Claude",11,{"title":9,"total":24},35,{"title":10,"total":26},85,{"title":28,"total":29},"DeepSeek",9,{"title":31,"total":32},"OCR",1,{"title":34,"total":35},"Chat",7,{"title":37,"total":38},"Generator",116,[40,48,55,64,71],{"id":41,"publish_date":42,"is_original":4,"collection":5,"cover_url":43,"cover_url_1_1":44,"title":45,"summary":46,"author":47},557,"2022-04-29","article_res/cover/7a9b1375ed9bb298154981bae42b794d.jpeg","article_res/cover/afa281dd52bc0454e6735daa8e6b0706.jpeg","Translation and summary of Messari Report [2.8 Kristin Smith, Blockchain Association and Katie Haun, a16z]","We need unity and speed right now.","Translation",{"id":49,"publish_date":50,"is_original":4,"collection":5,"cover_url":51,"cover_url_1_1":52,"title":53,"summary":54,"author":47},531,"2022-05-25","article_res/cover/e8362057f8fa189594c60afdfaaeb6e5.jpeg","article_res/cover/8ea08d0d6fa7eee6b57ed4ec61b61ad6.jpeg","Decentralized Society: Finding Web3’s Soul / Decentralized Society: Finding the Soul of Web3 -7","Decentralization through Pluralism When analyzing ecosystems, it's desirable to measure how decentralized it is.",{"id":56,"publish_date":57,"is_original":32,"collection":58,"cover_url":59,"cover_url_1_1":60,"title":61,"summary":62,"author":63},127,"2024-11-14","#Google #AI Game #World Model #AI Story","article_res/cover/0233a875b7ec2debf59779e311547569.jpeg","article_res/cover/6ffddb6ae4914b3c699493311aa9f198.jpeg","Google Launches \"Unbounded\": A Generative Infinite Character Life Simulation Game","Unbounded: A Generative Infinite Game of Character Life Simulation","Renee's Entrepreneurial Journey",{"id":13,"publish_date":65,"is_original":32,"collection":66,"cover_url":67,"cover_url_1_1":68,"title":69,"summary":70,"author":63},"2025-02-14","#Deep Dive into LLMs #Andrej Karpathy #LLM #Tool Use #Hallucination","article_res/cover/11e858ad6b74dfa80f923d549b62855c.jpeg","article_res/cover/615e1b320f1fc163edc1d2d154a6de33.jpeg","Andrej Karpathy's in-depth explanation of LLM (Part 4): Hallucinations","hallucinations, tool use, knowledge/working memory",{"id":72,"publish_date":73,"is_original":4,"collection":5,"cover_url":74,"cover_url_1_1":75,"title":76,"summary":77,"author":47},579,"2022-04-07","article_res/cover/39387376ba28447af1eb40576b9df215.jpeg","article_res/cover/02727ede8551ed49901d0abe6d6305b7.jpeg","Messari Report Translation and Summary 【1-7 Surviving the Winter】","I’d be more cautious here: 10 year and 10 hour thinking only.",[79,87,95,103,111],{"id":80,"publish_date":81,"is_original":32,"collection":82,"cover_url":83,"cover_url_1_1":84,"title":85,"summary":86,"author":63},627,"2025-03-20","#AI Avatar #AI Video Generation","article_res/cover/d95481358f73924989f8c4ee9c75d1c8.jpeg","article_res/cover/b74bc0fab01f8b6a6aa87696c0c3ed8b.jpeg","DisPose: Generating Animated Videos by Driving Video with Reference Images","DisPose is a controllable human image animation method that enhances video generation.",{"id":88,"publish_date":89,"is_original":32,"collection":90,"cover_url":91,"cover_url_1_1":92,"title":93,"summary":94,"author":63},626,"2025-03-21","#Deep Dive into LLMs #LLM #RL #Andrej Karpathy #AlphaGo","article_res/cover/446553a5c8f8f2f07d97b20eaee84e56.jpeg","article_res/cover/e6c2823409c9b34624064b9acbaca6f1.jpeg","AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)","Simply learning from humans will never surpass human capabilities.",{"id":96,"publish_date":97,"is_original":32,"collection":98,"cover_url":99,"cover_url_1_1":100,"title":101,"summary":102,"author":63},625,"2025-03-22","#Deep Dive into LLMs #LLM #RL #RLHF #Andrej Karpathy","article_res/cover/8da81d38b1e5cf558a164710fd8a5389.jpeg","article_res/cover/96f028d76c362a99a0dd56389e8f7a9b.jpeg","Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)","Fine-Tuning Language Models from Human Preferences",{"id":104,"publish_date":105,"is_original":32,"collection":106,"cover_url":107,"cover_url_1_1":108,"title":109,"summary":110,"author":63},624,"2025-03-23","#Deep Dive into LLMs #LLM #Andrej Karpathy #AI Agent #MMM","article_res/cover/a5e7c3d48bb09109684d6513287c661d.jpeg","article_res/cover/d3f22b7c0ab8d82fd2da457a299e0773.jpeg","The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)","preview of things to come",{"id":112,"publish_date":105,"is_original":32,"collection":113,"cover_url":114,"cover_url_1_1":115,"title":116,"summary":117,"author":63},623,"#Google #Voe #AI Video Generation","article_res/cover/c44062fea0f336c2b96b3928292392c2.jpeg","article_res/cover/a041041c69092ad3db191c5bf3ff981b.jpeg","Trial of Google's video generation model VOE2","Our state-of-the-art video generation model",[119,127,135],{"id":120,"publish_date":121,"is_original":32,"collection":122,"cover_url":123,"cover_url_1_1":124,"title":125,"summary":126,"author":63},160,"2024-10-04","#Philosophy","article_res/cover/496990c49211e8b7f996b7d39c18168e.jpeg","article_res/cover/14dbaa1ade9cb4316d5829423a900362.jpeg","Time","The fungus of the morning does not know the waxing and waning of the moon, and the cicada does not know the seasons; this is a short life. To the south of the state of Chu there is a dark spirit which regards five hundred years as spring and five hundred years as autumn. In ancient times there was a great tree called the Ming which regarded eight thousand years as spring and eight thousand years as autumn; this is a long life.",{"id":128,"publish_date":129,"is_original":32,"collection":130,"cover_url":131,"cover_url_1_1":132,"title":133,"summary":134,"author":63},98,"2024-12-17","#AI Video Generator #Sora #Pika","article_res/cover/3b86e85d03fff4f356a3e4cf2bb329c9.jpeg","article_res/cover/5fa5c20ad0b40f8f544d257c0ef02938.jpeg","Pika 2.0 video generation officially released: effect comparison with Sora","今天，我们推出了Pika 2.0模型。卓越的文字对齐效果。惊人的视觉表现。还有✨场景成分✨",{"id":136,"publish_date":137,"is_original":32,"collection":138,"cover_url":139,"cover_url_1_1":140,"title":141,"summary":142,"author":63},71,"2025-01-14","#Nvidia #World Foundation Model #Cosmos #Physical AI #Embodied AI","article_res/cover/feddf8c832dfb45d28804291f6a42a9e.jpeg","article_res/cover/d6bc2f1186d96b78228c2283a17a3645.jpeg","NVIDIA's Cosmos World Model","Cosmos World Foundation Model Platform for Physical AI",[144,163,188],{"title":8,"items":145},[146,147,155],{"id":104,"publish_date":105,"is_original":32,"collection":106,"cover_url":107,"cover_url_1_1":108,"title":109,"summary":110,"author":63},{"id":148,"publish_date":149,"is_original":32,"collection":150,"cover_url":151,"cover_url_1_1":152,"title":153,"summary":154,"author":63},622,"2025-03-24","#OWL #AI Agent #MAS #MCP #CUA","article_res/cover/cb50ca7f2bf4d1ed50202d7406e1c19a.jpeg","article_res/cover/4aa7aa3badfacf3cc84121334f1050dd.jpeg","OWL: Multi-agent collaboration","OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation",{"id":156,"publish_date":157,"is_original":32,"collection":158,"cover_url":159,"cover_url_1_1":160,"title":161,"summary":162,"author":63},620,"2025-03-26","#LLM #Google #Gemini #AI Agent","article_res/cover/53751a6dbbe990b1eb0b63f3b062aed4.jpeg","article_res/cover/031344981f0a212ff82d1f3a64aa5756.jpeg","Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings","Gemini 2.5: Our most intelligent AI model",{"title":9,"items":164},[165,172,180],{"id":166,"publish_date":157,"is_original":32,"collection":167,"cover_url":168,"cover_url_1_1":169,"title":170,"summary":171,"author":63},619,"#OpenAI #AI Image Generator #4o #MMM #AR Transformer","article_res/cover/2faffc97fcecf3151552cb0fd3206d89.jpeg","article_res/cover/1133cb4948af44cee2e7fbe79efb69e5.jpeg","The native image function of GPT-4o is officially launched","Introducing 4o Image Generation",{"id":173,"publish_date":174,"is_original":4,"collection":175,"cover_url":176,"cover_url_1_1":177,"title":178,"summary":179,"author":63},434,"2023-07-15","#Anthropic #OpenAI #Google #AI Code Generator #Claude","article_res/cover/e1b6f600a2b9f262a4392684e5f2ce25.jpeg","article_res/cover/6e1772e83f78f9a351ab23d3e414adee.jpeg","Latest Updates on Google Bard /Anthropic Claude2 / ChatGPT Code Interpreter","We want our models to use their programming skills to provide more natural interfaces to the basic functions of our computers.  \n - OpenAI",{"id":181,"publish_date":182,"is_original":4,"collection":183,"cover_url":184,"cover_url_1_1":185,"title":186,"summary":187,"author":63},417,"2023-08-24","#OpenAI","article_res/cover/bccf897d50a88b18364e35f7466387e0.jpeg","article_res/cover/2f871085c1073717c1703ae86e18056f.jpeg","The GPT-3.5 Turbo fine-tuning (fine-tuning function) has been released～","Developers can now bring their own data to customize GPT-3.5 Turbo for their use cases.",{"title":10,"items":189},[190,191,192],{"id":88,"publish_date":89,"is_original":32,"collection":90,"cover_url":91,"cover_url_1_1":92,"title":93,"summary":94,"author":63},{"id":96,"publish_date":97,"is_original":32,"collection":98,"cover_url":99,"cover_url_1_1":100,"title":101,"summary":102,"author":63},{"id":104,"publish_date":105,"is_original":32,"collection":106,"cover_url":107,"cover_url_1_1":108,"title":109,"summary":110,"author":63},true,{"code":4,"msg":5,"data":195},{"id":196,"publish_date":197,"is_original":32,"collection":198,"articles_id":199,"cover_url":200,"cover_url_1_1":201,"title":202,"summary":203,"author":63,"content":204},316,"2024-03-12","#Anthropic #Claude","pNtGX3npFAvvyqt-ES0OTw","article_res/cover/a618b54183a3af0c342a90a2c148aba3.jpeg","article_res/cover/2ea4501cee5874c4c00b1828a079dafb.jpeg","Introduction to Claude 3 (Part 1)","Being at the frontier of AI development is the most effective way to steer trajectory towards positive societal outcomes.","\u003Cdiv class=\"rich_media_content js_underline_content\n                       autoTypeSetting24psection\n            \" id=\"js_content\">\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>最近一直在出差和开会，没能及时跟进最新的AI动态。今天，试用了已经火了一段时间的Claude 3。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3在2024年3月4日发布的。Claude 3包含三个模型：Claude 3 Haiku、Claude 3 Sonnet和Claude 3 Opus，能力递增。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003822\" data-ratio=\"0.5333333333333333\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810903140.5360522770146201.jpeg\">\u003C/p>\u003Csection data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;color: rgb(0, 0, 0);font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;overflow-x: auto;'>\u003Ctable>\u003Cthead>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Cth style=\"border-top-width: 1px;border-color: rgb(204, 204, 204);background-color: rgb(240, 240, 240);text-align: left;min-width: 85px;\">功能/模型\u003C/th>\u003Cth style=\"border-top-width: 1px;border-color: rgb(204, 204, 204);background-color: rgb(240, 240, 240);text-align: left;min-width: 85px;\">Opus\u003C/th>\u003Cth style=\"border-top-width: 1px;border-color: rgb(204, 204, 204);background-color: rgb(240, 240, 240);text-align: left;min-width: 85px;\">Sonnet\u003C/th>\u003Cth style=\"border-top-width: 1px;border-color: rgb(204, 204, 204);background-color: rgb(240, 240, 240);text-align: left;min-width: 85px;\">Haiku\u003C/th>\u003C/tr>\u003C/thead>\u003Ctbody style=\"border-width: 0px;border-style: initial;border-color: initial;\">\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">描述\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">最智能的模型，在高度复杂任务上的性能是市场上最好的\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">实现了智能与速度之间的理想平衡，特别是对于企业工作负载\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">最快速、最紧凑的模型，能够提供近乎即时的响应性\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: rgb(248, 248, 248);\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">输入成本价格\u003Cbr>/million tokens\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$15\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$3\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$0.25\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">输出成本价格\u003Cbr>/million tokens\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$75\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$15\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">$1.25\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: rgb(248, 248, 248);\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">上下文窗口\u003Cbr>tokens\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">200K，同时针对特定用例提供1M\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">200K\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">200K\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">潜在应用\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">任务自动化、研发、策略\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">数据处理、销售、节省时间的任务\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">客户互动、内容审核、节省成本的任务\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: rgb(248, 248, 248);\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">特点\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">提供市场上其他所有模型都无法比拟的高智能\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">相比其他类似智能的模型更经济，更适合大规模应用\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">在其智能类别中更聪明、更快速、更经济\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">前端页面使用\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">claude.ai Pro 订阅者\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">claude.ai 免费用户\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">马上上线\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: rgb(248, 248, 248);\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">API 调用\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">支持\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">支持\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">马上上线\u003C/td>\u003C/tr>\u003Ctr style=\"border-width: 1px 0px 0px;border-right-style: initial;border-bottom-style: initial;border-left-style: initial;border-right-color: initial;border-bottom-color: initial;border-left-color: initial;border-top-style: solid;border-top-color: rgb(204, 204, 204);background-color: white;\">\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">第三方云平台支持\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">AWS Bedrock/Google Vertex AI Model Garden\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">马上上线\u003C/td>\u003Ctd style=\"border-color: rgb(204, 204, 204);min-width: 85px;\">马上上线\u003C/td>\u003C/tr>\u003C/tbody>\u003C/table>\u003C/section>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>多个能力基准测试比较\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>评估包括本科级专家知识（MMLU）、研究生级专家推理（GPQA）、基础数学（GSM8K）等。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3 Opus在复杂任务上展现出接近人类的理解和流畅度。所有Claude 3模型在分析和预测、细腻的内容创作、代码生成，以及使用西班牙语、日语和法语等非英语语言进行交流方面的能力都有所增强。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003823\" data-ratio=\"0.887962962962963\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810903230.21422936170824847.jpeg\">\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>推理速度\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3模型能够支持实时的客户聊天、自动补全和数据提取任务，这些任务需要即时和实时的响应。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Haiku是市场上同类智能模型中最快速和最具成本效益的。它能在不到三秒的时间内阅读一个信息和数据密集的arXiv研究论文（约10000个Token）及其图表和图形。随着产品的推出，预计性能将进一步提升。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>对于大多数工作负载而言，Sonnet的速度是Claude 2和Claude 2.1的两倍，而且智能水平更高。擅长需要快速响应的任务，如知识检索或销售自动化。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Opus的速度与Claude 2和2.1相似，但智能水平要高得多。\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>视觉能力比较\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3模型具有与其他领先模型相当的复杂视觉能力。它们能够处理各种视觉格式，包括照片、图表、图形和技术图解。可以理解各种格式编码，如PDF、流程图或演示幻灯片的知识库。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003824\" data-ratio=\"0.4361111111111111\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810903170.2731703159440677.jpeg\">\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>肉眼可见的聪明\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>以往的Claude模型经常做出不必要的拒绝回应，这暗示了对上下文的理解不足。相比下，Opus、Sonnet和Haiku在接近系统安全边界的提示上拒绝回答的可能性大大降低。如下所示，Claude 3模型对请求展现了更加细腻的理解，能够识别真正的危害，并且在面对无害的提示时较少拒绝回答。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003825\" data-ratio=\"0.41944444444444445\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810903190.2368689703140643.jpeg\">\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>将回答分为正确答案、错误答案（或幻觉）和不确定性承认，其中，不确定性承认是指模型声明它不知道答案，而不是提供错误信息。与Claude 2.1相比，Opus在这些具有挑战性的开放式问题上展现了两倍的准确性改进（或正确答案），同时还显示出减少的错误答案水平。除了产生更可信的回应之外，很快还将在Claude 3模型中启用引用功能，使它们能够指向参考材料中的确切句子来验证其答案。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003826\" data-ratio=\"0.4074074074074074\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810903200.16119913013500686.jpeg\">\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>上下文窗口\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3系列模型在初次发布时提供20万个上下文窗口。然而，所有三个模型都能够接受超过100万个token的输入，可能会向需要增强处理能力的特定客户提供这一功能。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>为了有效处理长上下文提示，模型需要强大的回忆能力。Claude 3采用了大海捞针'（NIAH）来进行评估。NIAH，或“大海捞针”（Needle In A Haystack），是一个评估模型的能力，特别是在从大量数据中准确提取特定信息的能力。在人工智能和机器学习领域，这种评估通常用于测试模型能否有效地从庞大、复杂的数据集中检索出极为细微或特定的信息片段。\u003C/p>\u003Cp style=\"text-align: center;\">\u003Cimg class=\"rich_pages wxw-img js_insertlocalimg\" data-imgfileid=\"100003827\" data-ratio=\"0.49444444444444446\" data-s=\"300,640\" data-type=\"webp\" data-w=\"1080\" style=\"\" src=\"https://res.cooltool.vip/article_res/assets/17423810904380.7072355349637891.jpeg\">\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>通过使用每个提示中的30个随机针/问题对之一，并在一个多样化的众包文档语料库上测试，增强了这一基准测试的稳健性。Claude 3 Opus不仅实现了近乎完美的回忆，准确率超过了99%，而且在某些情况下，它甚至识别了评估本身的局限性，认识到“针”句似乎是人为插入到原始文本中的。\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>安全和隐私\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>设有多个专门团队来追踪和减轻广泛的风险，包括错误信息、儿童色情材料（CSAM）、生物误用、选举干预和自主复制技能。团队还在继续开发如宪法式AI等方法，以提高模型的安全性和透明度，并已调整我们的模型以减轻新模态可能引发的隐私问题。如模型卡片所示，根据问题回答偏见基准测试（BBQ），Claude 3模型显示的偏见少于之前的模型。团队致力于推进减少偏见和促进模型更大中立性的技术，确保它们不偏向任何特定的政治立场。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>虽然Claude 3模型家族在生物学知识、网络相关知识和自主性方面相比以往模型有所进步，但根据负责任扩展政策，它仍处于AI安全级别2（ASL-2）。红队评估（按照团队对白宫的承诺和2023年美国行政命令进行）已得出结论，模型目前的灾难性风险潜力可以忽略不计。我们将继续仔细监控未来的模型，评估它们与ASL-3阈值的接近程度。\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>易于使用\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Claude 3模型更擅长遵循复杂的多步骤指令。特别擅长坚持品牌声音和响应指南，此外，Claude 3模型在生成像JSON这样流行的结构化输出方面表现更佳——使得指导Claude用于自然语言分类和情感分析等用例变得更简单。\u003C/p>\u003Ch2 data-tool=\"mdnice编辑器\" style='margin-top: 30px;margin-bottom: 15px;font-weight: bold;font-size: 22px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;letter-spacing: normal;text-align: left;text-wrap: wrap;'>未来展望\u003C/h2>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>Anthropic 团队认为模型智能远未触及其极限，并计划在接下来的几个月中频繁更新Claude 3模型家族。还计划发布一系列功能以增强模型的能力，特别是针对企业用例和大规模部署。这些新功能将包括工具使用（function calling）、交互式编程（REPL）和更高级的代理能力（agent）。\u003C/p>\u003Cp data-tool=\"mdnice编辑器\" style='margin-bottom: 0px;padding-top: 8px;padding-bottom: 8px;color: black;font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, \"PingFang SC\", Cambria, Cochin, Georgia, Times, \"Times New Roman\", serif;font-size: 16px;letter-spacing: normal;text-align: left;text-wrap: wrap;line-height: 26px;'>明天会分享一些Claude 3的实用案例分析。\u003C/p>\u003Cp>\u003Cbr>\u003C/p>\u003Cp style=\"display: none;\">\u003Cmp-style-type data-value=\"3\">\u003C/mp-style-type>\u003C/p>\u003C/div>",1752585426039]