Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS

StepFun
Enterprise
company
StepFun, founded in April 2023 with the mission to βScale-up possibilities for everyone,β unites top talent in artificial intelligence from both domestic and international backgrounds, and is dedicated to advancing toward AGI. The company has already launched the Step series of foundation models, which includes Step-2, a cutting-edge trillion-parameter Mixture of Experts (MoE) language model; Step-1.5V, a powerful multimodal large model; and Step-1V, an innovative image generation model, among others.\n","classNames":"hf-sanitized hf-sanitized-9IlRm3GGCsGZzBUVynxxu"},"users":[{"_id":"66935cee39002fc0569c2943","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/2Tc6iPC9qU8rlJgkZmgYG.jpeg","isPro":false,"fullname":"yaqi","user":"yy0511","type":"user"},{"_id":"6653eee7a2d7a882a805ab95","avatarUrl":"/avatars/0239432b3a79a468ac365fed416bd2ee.svg","isPro":false,"fullname":"jackieliu","user":"ucaslcl","type":"user"},{"_id":"625026b7d2d191ac43320c5e","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/625026b7d2d191ac43320c5e/2ExzHlZ-Bk8SQMyBjeY6N.jpeg","isPro":false,"fullname":"Jingcheng Hu","user":"reign12","type":"user"},{"_id":"6512d2e2d901d0d5e804edaf","avatarUrl":"/avatars/f4602412f615de2bfac5886805b3ee4c.svg","isPro":false,"fullname":"Jack Li","user":"Yameida","type":"user"},{"_id":"6436618aeef1f55654a9f458","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6436618aeef1f55654a9f458/OvxGtuDg2GAFG9As-2hzW.jpeg","isPro":false,"fullname":"Haoran Wei","user":"HaoranWei","type":"user"},{"_id":"64a543174260367c11a25c57","avatarUrl":"/avatars/26957f2b473b53b652aeeae4c7294c6d.svg","isPro":false,"fullname":"ma","user":"fisherma","type":"user"},{"_id":"639045d448a8602aca73d8e3","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/639045d448a8602aca73d8e3/P_q3fcGj3AaQT4uwOrupO.png","isPro":false,"fullname":"ma","user":"buyun","type":"user"},{"_id":"62d22496c58f969c152bcefd","avatarUrl":"/avatars/76c3b70e312f25e1e610473475553c5c.svg","isPro":false,"fullname":"Tiezhen WANG","user":"xianbao","type":"user"},{"_id":"62430a8522549d0917bfeb5a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62430a8522549d0917bfeb5a/l8jr2cvCp9YBK41XaV27R.jpeg","isPro":false,"fullname":"cheng","user":"littlebird13","type":"user"},{"_id":"643fa6a12397d8eef5b7d9a0","avatarUrl":"/avatars/acf221d21350accaa7a5ede0c38118d1.svg","isPro":false,"fullname":"Xinyuan","user":"bxjiao","type":"user"},{"_id":"6731be76eb618c0b4ff1fade","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/GWq7UUZWtU3eIjeHkDfWT.png","isPro":false,"fullname":"Han","user":"Robin-Han","type":"user"},{"_id":"63aa416e3453852ef542e6d8","avatarUrl":"/avatars/707924e79940f287084dcf420c841d2d.svg","isPro":false,"fullname":"Zhang Jinnan","user":"Topshare","type":"user"},{"_id":"661a28f12b14565c7a9debec","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/661a28f12b14565c7a9debec/fbfwOqxi-V7isLNQ89EXp.png","isPro":false,"fullname":"song","user":"oliversong","type":"user"},{"_id":"6357a847a8e247a69d4e70ce","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6357a847a8e247a69d4e70ce/dtPMPuBJCZZM_UPWf12RB.jpeg","isPro":false,"fullname":"Jiarui Fang","user":"feifeibear","type":"user"},{"_id":"66e51f8b90bafd3c9c36c1d6","avatarUrl":"/avatars/2ff61f210df40fdac43e0bccbf5872b6.svg","isPro":false,"fullname":"Xibo Sun","user":"xibosun","type":"user"},{"_id":"653641e1a2c81a3d29c23911","avatarUrl":"/avatars/c0d1dc2d606c9082b78397d8bbb4597e.svg","isPro":false,"fullname":"Jinzhe Pan","user":"Eigensystem","type":"user"},{"_id":"63240feb75bf010a73e255da","avatarUrl":"/avatars/29c416699d93e52fe8f1d045bc9f6688.svg","isPro":false,"fullname":"HuanCheng Bai","user":"bestony","type":"user"},{"_id":"67ad6ea1e1e22d2bcc3111f6","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/WG6imEOasasIjY2FsRiuv.png","isPro":false,"fullname":"Liwen","user":"step-ai","type":"user"},{"_id":"67aeb3a4820d941aab225178","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/1BII4nBQiT-rG2kUwXM6V.png","isPro":false,"fullname":"chao yan","user":"yanchaomars","type":"user"},{"_id":"65890851a66bd1cdb2816c32","avatarUrl":"/avatars/ea585c06a8854d92cb37bc569dfc6ee2.svg","isPro":false,"fullname":"mrh","user":"ryanmiao","type":"user"},{"_id":"6390252c888447611c2623b8","avatarUrl":"/avatars/6122445fca96634ab1ff97c78909bc29.svg","isPro":false,"fullname":"li","user":"melodyeee","type":"user"},{"_id":"648183bfea00b120773198ba","avatarUrl":"/avatars/b69d04bba11e393d1a59e98653d5cfa6.svg","isPro":false,"fullname":"cc","user":"ccchen1006","type":"user"},{"_id":"63565cc56d7fcf1bedb7d347","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63565cc56d7fcf1bedb7d347/XGcHP4VkO_oieA1gZ4IAX.jpeg","isPro":false,"fullname":"Zhang Peiyuan","user":"PY007","type":"user"},{"_id":"62d363143eebd640a4fa41fa","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62d363143eebd640a4fa41fa/pvPwXlJ5OOb-UIfmffv4E.jpeg","isPro":false,"fullname":"Hao Zhang","user":"zhisbug","type":"user"},{"_id":"65d60ee5b26b36a38c934e07","avatarUrl":"/avatars/234285b5b07e97ec8345554b5d9e1b5f.svg","isPro":false,"fullname":"HX Bu","user":"HXBu","type":"user"},{"_id":"635ca0d0e3737b9e4e2a1c93","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/635ca0d0e3737b9e4e2a1c93/TXJf098FkHyGqUBe3pI6v.jpeg","isPro":false,"fullname":"CongLiu","user":"Congliu","type":"user"},{"_id":"647892ab8315f87514536a1c","avatarUrl":"/avatars/48157cc4189e976e53fe8ef850d66fc4.svg","isPro":false,"fullname":"lu ","user":"luzig","type":"user"},{"_id":"63dfcad5a98d931aa9043d96","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63dfcad5a98d931aa9043d96/OkuqanqLNeklPKpYk9nvc.png","isPro":false,"fullname":"JO.Z","user":"Jojodecay","type":"user"},{"_id":"63ef38c81e695b35aa489104","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1676622044889-63ef38c81e695b35aa489104.jpeg","isPro":false,"fullname":"qinglongshengzhe","user":"bdsqlsz","type":"user"},{"_id":"6319b7c102fb322037350e0a","avatarUrl":"/avatars/a2597011fe5f9c5164e206ce2f0b287c.svg","isPro":false,"fullname":"a9ua12d","user":"a9ua12d","type":"user"},{"_id":"5efdfa85ff69163f6f59e610","avatarUrl":"/avatars/fb941973c1a42323eb188ac2f200bbf8.svg","isPro":false,"fullname":"Yuxiang Zhang","user":"jckhang","type":"user"},{"_id":"67b2d60b76532622d64a824f","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/8RDnVZjvBTSkxoCReoPFX.png","isPro":false,"fullname":"bysonlee","user":"bysonlee","type":"user"},{"_id":"6333f97256ed43a47d49cd69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6333f97256ed43a47d49cd69/ZJ93KxsoamnHWQjl2L9R6.png","isPro":false,"fullname":"orange","user":"orangewong","type":"user"},{"_id":"63ed5c2ff765928ceeafa12d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63ed5c2ff765928ceeafa12d/NRqSOoIeEwWn3Bi5Aafql.jpeg","isPro":false,"fullname":"Junhan Huang","user":"robinjhuang","type":"user"},{"_id":"65a0f71f19665f75497c348c","avatarUrl":"/avatars/a85d3ada6e8a12556fe2cd78e4ed5416.svg","isPro":false,"fullname":"nebuladream","user":"nebuladream","type":"user"},{"_id":"642a79ef556ab448a07003b9","avatarUrl":"/avatars/0eaf839ae6c8c621d32424f12da7846c.svg","isPro":false,"fullname":"comfy","user":"comfyanonymous","type":"user"},{"_id":"66c9697c3386b8e3eee9b669","avatarUrl":"/avatars/4ec3d2b573093e3c082118840eb0d300.svg","isPro":false,"fullname":"huang","user":"hhyhhyhy","type":"user"},{"_id":"6447756ae6161a1f32e1c734","avatarUrl":"/avatars/27fabb6e85b7405c1668201ce7cd51aa.svg","isPro":false,"fullname":"bowang","user":"bwang3579","type":"user"}],"collections":[{"slug":"stepfun-ai/step-audio-67b33accf45735bb21131b0b","title":"Step-Audio","description":"Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS","gating":false,"lastUpdated":"2025-02-17T13:59:45.346Z","owner":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"items":[{"_id":"67b33b59dea5247652f28032","position":0,"type":"model","author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":0,"gated":false,"id":"stepfun-ai/Step-Audio-Tokenizer","availableInferenceProviders":[],"lastModified":"2025-02-18T03:41:56.000Z","likes":32,"private":false,"repoType":"model","isLikedByUser":false},{"_id":"67b33b69a727ad35ff7d93d6","position":1,"type":"model","author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":1204,"gated":false,"id":"stepfun-ai/Step-Audio-Chat","availableInferenceProviders":[],"lastModified":"2025-02-17T15:51:26.000Z","likes":427,"pipeline_tag":"audio-text-to-text","private":false,"repoType":"model","isLikedByUser":false},{"_id":"67b33b77e1c8934918ef3628","position":2,"type":"model","author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":2094,"gated":false,"id":"stepfun-ai/Step-Audio-TTS-3B","availableInferenceProviders":[],"lastModified":"2025-02-17T15:50:57.000Z","likes":168,"pipeline_tag":"text-to-speech","private":false,"repoType":"model","isLikedByUser":false}],"position":0,"theme":"pink","private":false,"shareUrl":"https://huggingface.co/collections/stepfun-ai/step-audio-67b33accf45735bb21131b0b","upvotes":30,"isUpvotedByUser":false}],"datasets":[{"author":"stepfun-ai","downloads":1604,"gated":false,"id":"stepfun-ai/Step-Video-T2V-Eval","lastModified":"2025-02-19T02:56:44.000Z","datasetsServerInfo":{"viewer":"viewer","numRows":128,"libraries":["datasets","mlcroissant"],"formats":[],"modalities":["video"]},"private":false,"repoType":"dataset","likes":0,"isLikedByUser":false},{"author":"stepfun-ai","downloads":918,"gated":false,"id":"stepfun-ai/StepEval-Audio-360","lastModified":"2025-02-18T07:01:06.000Z","datasetsServerInfo":{"viewer":"viewer","numRows":137,"libraries":["datasets","pandas","mlcroissant","polars"],"formats":["parquet"],"modalities":["text"]},"private":false,"repoType":"dataset","likes":16,"isLikedByUser":false}],"models":[{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":1910,"gated":false,"id":"stepfun-ai/stepvideo-t2v","availableInferenceProviders":[],"lastModified":"2025-02-19T03:02:32.000Z","likes":416,"pipeline_tag":"text-to-video","private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":0,"gated":false,"id":"stepfun-ai/Step-Audio-Tokenizer","availableInferenceProviders":[],"lastModified":"2025-02-18T03:41:56.000Z","likes":32,"private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":1204,"gated":false,"id":"stepfun-ai/Step-Audio-Chat","availableInferenceProviders":[],"lastModified":"2025-02-17T15:51:26.000Z","likes":427,"pipeline_tag":"audio-text-to-text","private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":2094,"gated":false,"id":"stepfun-ai/Step-Audio-TTS-3B","availableInferenceProviders":[],"lastModified":"2025-02-17T15:50:57.000Z","likes":168,"pipeline_tag":"text-to-speech","private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":0,"gated":false,"id":"stepfun-ai/stepvideo-t2v-turbo","availableInferenceProviders":[],"lastModified":"2025-02-17T03:08:24.000Z","likes":85,"private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":76941,"gated":false,"id":"stepfun-ai/GOT-OCR2_0","availableInferenceProviders":[],"lastModified":"2025-02-04T00:37:25.000Z","likes":1418,"pipeline_tag":"image-text-to-text","private":false,"repoType":"model","isLikedByUser":false},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"downloads":189154,"gated":false,"id":"stepfun-ai/GOT-OCR-2.0-hf","availableInferenceProviders":[],"lastModified":"2025-01-31T16:40:29.000Z","likes":171,"pipeline_tag":"image-text-to-text","private":false,"repoType":"model","isLikedByUser":false}],"spaces":[{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"colorFrom":"red","colorTo":"pink","createdAt":"2025-02-14T08:07:06.000Z","emoji":"π","id":"stepfun-ai/Step-Audio","lastModified":"2025-02-27T08:09:45.000Z","likes":17,"pinned":false,"private":false,"repoType":"space","runtime":{"stage":"RUNNING","hardware":{"current":"cpu-basic","requested":"cpu-basic"},"storage":null,"gcTimeout":172800,"replicas":{"current":1,"requested":1},"devMode":false,"domains":[{"domain":"stepfun-ai-step-audio.hf.space","stage":"READY"}],"sha":"1a6b55d77d16736e5e5b7166f741db4b7fe4753d"},"shortDescription":"Generate audio responses from text or audio","title":"Step Audio","isLikedByUser":false,"ai_short_description":"Convert audio to text and chat with AI"},{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"colorFrom":"yellow","colorTo":"purple","createdAt":"2024-09-13T13:59:51.000Z","emoji":"π¬","id":"stepfun-ai/GOT_official_online_demo","lastModified":"2024-09-17T06:29:22.000Z","likes":358,"pinned":false,"private":false,"repoType":"space","runtime":{"stage":"RUNNING","hardware":{"current":"zero-a10g","requested":"zero-a10g"},"storage":null,"gcTimeout":172800,"replicas":{"current":1,"requested":1},"devMode":false,"domains":[{"domain":"stepfun-ai-got-official-online-demo.hf.space","stage":"READY"}],"sha":"00fd0d1eb9148c87a38c55f1cdf2226c592793d8"},"title":"GOT Online","isLikedByUser":false,"ai_short_description":"Extract text from images using various OCR modes"}],"repoFilterModels":{"sortKey":"modified"},"repoFilterDatasets":{"sortKey":"modified"},"repoFilterSpaces":{"sortKey":"modified"},"lastOrgActivities":[{"time":"2025-03-07T09:16:57.545Z","user":"PY007","userAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63565cc56d7fcf1bedb7d347/XGcHP4VkO_oieA1gZ4IAX.jpeg","type":"paper","paper":{"id":"2503.03803","title":"EgoLife: Towards Egocentric Life Assistant","publishedAt":"2025-03-05T18:54:16.000Z","upvotes":35,"isUpvotedByUser":true}},{"time":"2025-03-03T14:50:49.731Z","user":"fisherma","userAvatarUrl":"/avatars/26957f2b473b53b652aeeae4c7294c6d.svg","org":"stepfun-ai","orgAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","type":"discussion","discussionData":{"num":7,"author":{"_id":"66181450e82975d6a95d5395","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kApC-Rv6Kj5KkaMTzbgc4.png","fullname":"Zeeshan Ali","name":"zeeshanali90233","type":"user","isPro":false,"isHf":false,"isMod":false},"repo":{"name":"stepfun-ai/stepvideo-t2v","type":"model"},"title":"π© Report","status":"open","createdAt":"2025-02-28T06:45:34.000Z","isPullRequest":false,"numComments":2,"pinned":false,"repoOwner":{"name":"stepfun-ai","isParticipating":true,"type":"org","isDiscussionAuthor":false}},"repoId":"stepfun-ai/stepvideo-t2v","repoType":"model","eventId":"67c5c1c97c714825bf80376a"},{"time":"2025-02-27T08:09:58.858Z","user":"buyun","userAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/639045d448a8602aca73d8e3/P_q3fcGj3AaQT4uwOrupO.png","orgAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","type":"update","repoData":{"author":"stepfun-ai","authorData":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66935cee39002fc0569c2943/Qv8QPbkgoKE3wR4jTzHiy.png","fullname":"StepFun","name":"stepfun-ai","type":"org","isHf":false,"isMod":false,"isEnterprise":true,"followerCount":775},"colorFrom":"red","colorTo":"pink","createdAt":"2025-02-14T08:07:06.000Z","emoji":"π","id":"stepfun-ai/Step-Audio","lastModified":"2025-02-27T08:09:45.000Z","likes":17,"pinned":false,"private":false,"sdk":"gradio","repoType":"space","runtime":{"stage":"RUNNING","hardware":{"current":"cpu-basic","requested":"cpu-basic"},"storage":null,"gcTimeout":172800,"replicas":{"current":1,"requested":1},"devMode":false,"domains":[{"domain":"stepfun-ai-step-audio.hf.space","stage":"READY"}],"sha":"1a6b55d77d16736e5e5b7166f741db4b7fe4753d"},"shortDescription":"Generate audio responses from text or audio","title":"Step Audio","isLikedByUser":false,"ai_short_description":"Convert audio to text and chat with AI","trendingScore":1},"repoId":"stepfun-ai/Step-Audio","repoType":"space","org":"stepfun-ai"}],"acceptLanguages":["en","*"],"blogPosts":[]}">
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Welcome to StepFun π
StepFun, founded in April 2023 with the mission to βScale-up possibilities for everyone,β unites top talent in artificial intelligence from both domestic and international backgrounds, and is dedicated to advancing toward AGI. The company has already launched the Step series of foundation models, which includes Step-2, a cutting-edge trillion-parameter Mixture of Experts (MoE) language model; Step-1.5V, a powerful multimodal large model; and Step-1V, an innovative image generation model, among others.
Collections
1
spaces
2
models
7

stepfun-ai/stepvideo-t2v
Text-to-Video
β’
Updated
β’
1.91k
β’
416

stepfun-ai/Step-Audio-Tokenizer
Updated
β’
32

stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text
β’
Updated
β’
1.2k
β’
427

stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
β’
Updated
β’
2.09k
β’
168

stepfun-ai/stepvideo-t2v-turbo
Updated
β’
85

stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
Updated
β’
76.9k
β’
1.42k

stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
β’
Updated
β’
189k
β’
171