\n\n@limcheekin\n\t You may find this discussion helpful
https://huggingface.co/microsoft/Phi-4-multimodal-instruct/discussions/7#67c4d764491ec4e926ed9d84
Is speaker diarization possible?
\n","updatedAt":"2025-03-06T18:24:55.120Z","author":{"_id":"668ae510c1c9eaffe41f1069","avatarUrl":"/avatars/cfdd3abd0cce00aedd572c7b82ef57ee.svg","fullname":"Non","name":"Bash82","type":"user","isPro":false,"isHf":false,"isMod":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8537078499794006},"editors":["Bash82"],"editorAvatarUrls":["/avatars/cfdd3abd0cce00aedd572c7b82ef57ee.svg"],"reactions":[],"isReport":false}},{"id":"67d37d9441636d7ec43c8ce2","author":{"_id":"5f3ec133a4dd343b63a632dd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg","fullname":"Nguyen Bach","name":"nguyenbh","type":"user","isPro":false,"isHf":false,"isMod":false,"followerCount":31,"isOwner":false,"isOrgMember":true},"createdAt":"2025-03-14T00:51:32.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"@Bash82 the Phi-4-multimodal does not support speaker diarization. ","html":"\n\n@Bash82\n\t the Phi-4-multimodal does not support speaker diarization.
\n","updatedAt":"2025-03-14T00:51:32.465Z","author":{"_id":"5f3ec133a4dd343b63a632dd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg","fullname":"Nguyen Bach","name":"nguyenbh","type":"user","isPro":false,"isHf":false,"isMod":false,"followerCount":31}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7568219900131226},"editors":["nguyenbh"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg"],"reactions":[],"isReport":false}}],"pinned":false,"locked":false,"collection":"discussions","isPullRequest":false,"isReport":false},"repo":{"name":"microsoft/Phi-4-multimodal-instruct","type":"model"},"activeTab":"discussion","discussionRole":0}">How to use it with LM Studio?
\n\n@limcheekin\n\t You may find this discussion helpful
https://huggingface.co/microsoft/Phi-4-multimodal-instruct/discussions/7#67c4d764491ec4e926ed9d84
Is speaker diarization possible?
\n","updatedAt":"2025-03-06T18:24:55.120Z","author":{"_id":"668ae510c1c9eaffe41f1069","avatarUrl":"/avatars/cfdd3abd0cce00aedd572c7b82ef57ee.svg","fullname":"Non","name":"Bash82","type":"user","isPro":false,"isHf":false,"isMod":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8537078499794006},"editors":["Bash82"],"editorAvatarUrls":["/avatars/cfdd3abd0cce00aedd572c7b82ef57ee.svg"],"reactions":[],"isReport":false}},{"id":"67d37d9441636d7ec43c8ce2","author":{"_id":"5f3ec133a4dd343b63a632dd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg","fullname":"Nguyen Bach","name":"nguyenbh","type":"user","isPro":false,"isHf":false,"isMod":false,"followerCount":31,"isOwner":false,"isOrgMember":true},"createdAt":"2025-03-14T00:51:32.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"@Bash82 the Phi-4-multimodal does not support speaker diarization. ","html":"\n\n@Bash82\n\t the Phi-4-multimodal does not support speaker diarization.
\n","updatedAt":"2025-03-14T00:51:32.465Z","author":{"_id":"5f3ec133a4dd343b63a632dd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg","fullname":"Nguyen Bach","name":"nguyenbh","type":"user","isPro":false,"isHf":false,"isMod":false,"followerCount":31}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7568219900131226},"editors":["nguyenbh"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg"],"reactions":[],"isReport":false}}],"pinned":false,"locked":false,"collection":"discussions","isPullRequest":false,"isReport":false},"primaryEmailConfirmed":false,"repo":{"name":"microsoft/Phi-4-multimodal-instruct","type":"model"},"discussionRole":0,"acceptLanguages":["en","*"],"disableDiscussionClosingAndCommentHiding":false,"hideComments":true}">Can't find it in LM Studio download
Someone has to convert this to GGUF, it shouldn't take long.
After having a gguf version will it be able to do audio tasks, like ASR, speaker diarization, speech synthesis, as well?
Someone has to convert this to GGUF, it shouldn't take long.
I guess GGUF not supported, otherwise we should see it by now.
Perhaps we can count on the ONNX version for CPU only environment for now.
Not sure LM Studio support it though.
https://huggingface.co/microsoft/Phi-4-multimodal-instruct-onnx
@limcheekin
You may find this discussion helpful
https://huggingface.co/microsoft/Phi-4-multimodal-instruct/discussions/7#67c4d764491ec4e926ed9d84
Is speaker diarization possible?