{"id":7622,"date":"2024-10-24T00:56:57","date_gmt":"2024-10-23T16:56:57","guid":{"rendered":"https:\/\/www.aisharenet.com\/?p=7622"},"modified":"2025-02-10T14:55:54","modified_gmt":"2025-02-10T06:55:54","slug":"cosyvoice","status":"publish","type":"post","link":"https:\/\/www.kdjingpai.com\/de\/cosyvoice\/","title":{"rendered":"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\u5f00\u6e90\u9879\u76ee\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e"},"content":{"rendered":"<p>CosyVoice\u662f\u4e00\u4e2a\u591a\u8bed\u8a00\u5927\u89c4\u6a21\u8bed\u97f3\u751f\u6210\u6a21\u578b\uff0c\u63d0\u4f9b\u4ece\u63a8\u7406\u3001\u8bad\u7ec3\u5230\u90e8\u7f72\u7684\u5168\u6808\u80fd\u529b\u3002\u8be5\u9879\u76ee\u7531FunAudioLLM\u56e2\u961f\u5f00\u53d1\uff0c\u65e8\u5728\u901a\u8fc7\u5148\u8fdb\u7684\u81ea\u56de\u5f52\u53d8\u6362\u5668\u548c\u57fa\u4e8eODE\u7684\u6269\u6563\u6a21\u578b\uff0c\u5b9e\u73b0\u9ad8\u8d28\u91cf\u7684\u8bed\u97f3\u5408\u6210\u3002CosyVoice\u4e0d\u4ec5\u652f\u6301\u591a\u8bed\u8a00\u8bed\u97f3\u751f\u6210\uff0c\u8fd8\u80fd\u8fdb\u884c\u60c5\u611f\u63a7\u5236\u548c\u7ca4\u8bed\u5408\u6210\uff0c\u8fbe\u5230\u4e0e\u4eba\u7c7b\u53d1\u97f3\u76f8\u5f53\u7684\u6c34\u5e73\u3002<\/p>\n<p>\u514d\u8d39\u5728\u7ebf\u4f53\u9a8c\uff08\u6587\u672c\u8f6c\u8bed\u97f3\uff09\uff1ahttps:\/\/modelscope.cn\/studios\/iic\/CosyVoice-300M<\/p>\n<p>\u514d\u8d39\u5728\u7ebf\u4f53\u9a8c\uff08\u8bed\u97f3\u8f6c\u6587\u672c\uff09\uff1ahttps:\/\/www.modelscope.cn\/studios\/iic\/SenseVoice<\/p>\n<p><img decoding=\"async\" title=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-1\" src=\"https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/055b99082b1b94f.png\" alt=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-1\" \/><\/p>\n<p>&nbsp;<\/p>\n<h2>\u529f\u80fd\u5217\u8868<\/h2>\n<ul>\n<li>\u591a\u8bed\u8a00\u8bed\u97f3\u751f\u6210\uff1a\u652f\u6301\u591a\u79cd\u8bed\u8a00\u7684\u8bed\u97f3\u5408\u6210\u3002<\/li>\n<li>\u8bed\u97f3\u514b\u9686\uff1a\u80fd\u591f\u514b\u9686\u7279\u5b9a\u8bf4\u8bdd\u4eba\u7684\u8bed\u97f3\u7279\u5f81\u3002<\/li>\n<li>\u6587\u672c\u8f6c\u8bed\u97f3\uff1a\u5c06\u6587\u672c\u5185\u5bb9\u8f6c\u6362\u4e3a\u81ea\u7136\u6d41\u7545\u7684\u8bed\u97f3\u3002<\/li>\n<li>\u60c5\u611f\u63a7\u5236\uff1a\u5408\u6210\u8bed\u97f3\u65f6\u53ef\u8c03\u8282\u60c5\u611f\u8868\u8fbe\u3002<\/li>\n<li>\u7ca4\u8bed\u5408\u6210\uff1a\u652f\u6301\u7ca4\u8bed\u7684\u8bed\u97f3\u751f\u6210\u3002<\/li>\n<li>\u9ad8\u8d28\u91cf\u97f3\u9891\u8f93\u51fa\uff1a\u901a\u8fc7HiFTNet\u58f0\u7801\u5668\u5408\u6210\u9ad8\u4fdd\u771f\u97f3\u9891\u3002<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h2>\u4f7f\u7528\u5e2e\u52a9<\/h2>\n<h3>\u5b89\u88c5\u6d41\u7a0b<\/h3>\n<p>\u8fd1\u65e5\uff0c\u963f\u91cc\u901a\u4e49\u5b9e\u9a8c\u5ba4\u5f00\u6e90\u4e86CosyVoice\u8bed\u97f3\u6a21\u578b\uff0c\u5b83\u652f\u6301\u81ea\u7136\u8bed\u97f3\u751f\u6210\uff0c\u652f\u6301\u591a\u8bed\u8a00\u3001\u97f3\u8272\u548c\u60c5\u611f\u63a7\u5236\uff0c\u5728\u591a\u8bed\u8a00\u8bed\u97f3\u751f\u6210\u3001\u96f6\u6837\u672c\u8bed\u97f3\u751f\u6210\u3001\u8de8\u8bed\u8a00\u58f0\u97f3\u5408\u6210\u548c\u6307\u4ee4\u6267\u884c\u80fd\u529b\u65b9\u9762\u8868\u73b0\u5353\u8d8a\u3002<\/p>\n<p>CosyVoice\u91c7\u7528\u4e86\u603b\u5171\u8d8515\u4e07\u5c0f\u65f6\u7684\u6570\u636e\u8bad\u7ec3\uff0c\u652f\u6301\u4e2d\u82f1\u65e5\u7ca4\u97e95\u79cd\u8bed\u8a00\u7684\u5408\u6210\uff0c\u5408\u6210\u6548\u679c\u663e\u8457\u4f18\u4e8e\u4f20\u7edf\u8bed\u97f3\u5408\u6210\u6a21\u578b\u3002<\/p>\n<p>CosyVoice\u652f\u6301one-shot\u97f3\u8272\u514b\u9686 \uff1a\u4ec5\u9700\u89813~10s\u7684\u539f\u59cb\u97f3\u9891\uff0c\u5373\u53ef\u751f\u6210\u6a21\u62df\u97f3\u8272\uff0c\u751a\u81f3\u5305\u62ec\u97f5\u5f8b\u3001\u60c5\u611f\u7b49\u7ec6\u8282\u3002\u5728\u8de8\u8bed\u79cd\u7684\u8bed\u97f3\u5408\u6210\u4e2d\uff0c\u4e5f\u6709\u4e0d\u4fd7\u7684\u8868\u73b0\u3002<\/p>\n<p>\u7531\u4e8e\u5b98\u65b9\u7684\u7248\u672c\u6682\u4e0d\u652f\u6301Windows\u548cMac\u5e73\u53f0\uff0c\u672c\u6b21\u6211\u4eec\u5206\u522b\u5728\u8fd9\u4e24\u4e2a\u5e73\u53f0\u672c\u5730\u90e8\u7f72CosyVoice\u3002<\/p>\n<h4>Windows\u5e73\u53f0<\/h4>\n<p>\u9996\u5148\u6765\u5230windows\u5e73\u53f0\uff0c\u514b\u9686\u9879\u76ee\uff1a<\/p>\n<pre><code>git clone https:\/\/github.com\/v3ucn\/CosyVoice_For_Windows\r\n<\/code><\/pre>\n<p>\u8fdb\u5165\u9879\u76ee:<\/p>\n<pre><code>cd CosyVoice_For_Windows\r\n<\/code><\/pre>\n<p>\u751f\u6210\u5185\u7f6e\u6a21\u5757\uff1a<\/p>\n<pre><code>git submodule update --init --recursive\r\n<\/code><\/pre>\n<p>\u968f\u540e\u5b89\u88c5\u4f9d\u8d56\uff1a<\/p>\n<pre><code>conda create -n <a href=\"https:\/\/www.kdjingpai.com\/de\/cosyvoice-2\/\">cosyvoice<\/a> python=3.11  \r\nconda activate cosyvoice  \r\npip install -r requirements.txt -i https:\/\/mirrors.aliyun.com\/pypi\/simple\/ --trusted-host=mirrors.aliyun.com\r\n<\/code><\/pre>\n<p>\u5b98\u65b9\u63a8\u8350\u7684Python\u7248\u672c\u662f3.8\uff0c\u5b9e\u9645\u4e0a3.11\u4e5f\u662f\u53ef\u4ee5\u8dd1\u8d77\u6765\u7684\uff0c\u5e76\u4e14\u7406\u8bba\u4e0a3.11\u7684\u6027\u80fd\u66f4\u597d\u3002<\/p>\n<p>\u968f\u540e\u4e0b\u8f7ddeepspeed\u7684windows\u7248\u672c\u5b89\u88c5\u5305\u6765\u8fdb\u884c\u5b89\u88c5\uff1a<\/p>\n<pre><code>https:\/\/github.com\/S95Sedan\/Deepspeed-Windows\/releases\/tag\/v14.0%2Bpy311\r\n<\/code><\/pre>\n<p>\u6700\u540e\uff0c\u5b89\u88c5gpu\u7248\u672c\u7684torch:<\/p>\n<pre><code>pip install torch torchvision torchaudio --index-url https:\/\/download.pytorch.org\/whl\/cu121\r\n<\/code><\/pre>\n<p>\u8fd9\u91cccuda\u7684\u7248\u672c\u9009\u62e912\uff0c\u4e5f\u53ef\u4ee5\u5b89\u88c511\u7684\u3002<\/p>\n<p>\u968f\u540e\u4e0b\u8f7d\u6a21\u578b\uff1a<\/p>\n<pre><code># git\u6a21\u578b\u4e0b\u8f7d\uff0c\u8bf7\u786e\u4fdd\u5df2\u5b89\u88c5git lfs  \r\nmkdir -p pretrained_models  \r\ngit clone https:\/\/www.modelscope.cn\/iic\/CosyVoice-300M.git pretrained_models\/CosyVoice-300M  \r\ngit clone https:\/\/www.modelscope.cn\/iic\/CosyVoice-300M-SFT.git pretrained_models\/CosyVoice-300M-SFT  \r\ngit clone https:\/\/www.modelscope.cn\/iic\/CosyVoice-300M-Instruct.git pretrained_models\/CosyVoice-300M-Instruct  \r\ngit clone https:\/\/www.modelscope.cn\/speech_tts\/speech_kantts_ttsfrd.git pretrained_models\/speech_kantts_ttsfrd\r\n<\/code><\/pre>\n<p>\u7531\u4e8e\u4f7f\u7528\u56fd\u5185\u7684\u9b54\u642d\u4ed3\u5e93\uff0c\u6240\u4ee5\u901f\u5ea6\u975e\u5e38\u5feb<\/p>\n<p>\u6700\u540e\u6dfb\u52a0\u73af\u5883\u53d8\u91cf\uff1a<\/p>\n<pre><code>set PYTHONPATH=third_party\/AcademiCodec;third_party\/Matcha-TTS\r\n<\/code><\/pre>\n<p>\u57fa\u7840\u7528\u6cd5\uff1a<\/p>\n<pre><code>from cosyvoice.cli.cosyvoice import CosyVoice  \r\nfrom cosyvoice.utils.file_utils import load_wav  \r\nimport torchaudio  \r\ncosyvoice = CosyVoice('speech_tts\/CosyVoice-300M-SFT')  \r\n# sft usage  \r\nprint(cosyvoice.list_avaliable_spks())  \r\noutput = cosyvoice.inference_sft('\u4f60\u597d\uff0c\u6211\u662f\u901a\u4e49\u751f\u6210\u5f0f\u8bed\u97f3\u5927\u6a21\u578b\uff0c\u8bf7\u95ee\u6709\u4ec0\u4e48\u53ef\u4ee5\u5e2e\u60a8\u7684\u5417\uff1f', '\u4e2d\u6587\u5973')  \r\ntorchaudio.save('sft.wav', output['tts_speech'], 22050)  \r\ncosyvoice = CosyVoice('speech_tts\/CosyVoice-300M')  \r\n# zero_shot usage  \r\nprompt_speech_16k = load_wav('zero_shot_prompt.wav', 16000)  \r\noutput = cosyvoice.inference_zero_shot('\u6536\u5230\u597d\u53cb\u4ece\u8fdc\u65b9\u5bc4\u6765\u7684\u751f\u65e5\u793c\u7269\uff0c\u90a3\u4efd\u610f\u5916\u7684\u60ca\u559c\u4e0e\u6df1\u6df1\u7684\u795d\u798f\u8ba9\u6211\u5fc3\u4e2d\u5145\u6ee1\u4e86\u751c\u871c\u7684\u5feb\u4e50\uff0c\u7b11\u5bb9\u5982\u82b1\u513f\u822c\u7efd\u653e\u3002', '\u5e0c\u671b\u4f60\u4ee5\u540e\u80fd\u591f\u505a\u7684\u6bd4\u6211\u8fd8\u597d\u5466\u3002', prompt_speech_16k)  \r\ntorchaudio.save('zero_shot.wav', output['tts_speech'], 22050)  \r\n# cross_lingual usage  \r\nprompt_speech_16k = load_wav('cross_lingual_prompt.wav', 16000)  \r\noutput = cosyvoice.inference_cross_lingual('&lt;|en|&gt;And then later on, fully acquiring that company. So keeping management in line, interest in line with the asset that\\'s coming into the family is a reason why sometimes we don\\'t buy the whole thing.', prompt_speech_16k)  \r\ntorchaudio.save('cross_lingual.wav', output['tts_speech'], 22050)  \r\ncosyvoice = CosyVoice('speech_tts\/CosyVoice-300M-Instruct')  \r\n# instruct usage  \r\noutput = cosyvoice.inference_instruct('\u5728\u9762\u5bf9\u6311\u6218\u65f6\uff0c\u4ed6\u5c55\u73b0\u4e86\u975e\u51e1\u7684&lt;strong&gt;\u52c7\u6c14&lt;\/strong&gt;\u4e0e&lt;strong&gt;\u667a\u6167&lt;\/strong&gt;\u3002', '\u4e2d\u6587\u7537', 'Theo \\'Crimson\\', is a fiery, passionate rebel leader. Fights with fervor for justice, but struggles with impulsiveness.')  \r\ntorchaudio.save('instruct.wav', output['tts_speech'], 22050)\r\n<\/code><\/pre>\n<p>\u8fd9\u91cc\u63a8\u8350\u4f7f\u7528webui\uff0c\u66f4\u52a0\u76f4\u89c2\u548c\u65b9\u4fbf\uff1a<\/p>\n<pre><code>python3 webui.py --port 9886 --model_dir .\/pretrained_models\/CosyVoice-300M\r\n<\/code><\/pre>\n<p>\u8bbf\u95ee\u00a0http:\/\/localhost:9886<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-7869\" title=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\u5f00\u6e90\u9879\u76ee\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-1\" src=\"https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff.png\" alt=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\u5f00\u6e90\u9879\u76ee\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-1\" width=\"3633\" height=\"2072\" srcset=\"https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff.png 3633w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff-300x171.png 300w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff-1024x584.png 1024w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff-768x438.png 768w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff-1536x876.png 1536w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/14a4e9f0e4087ff-2048x1168.png 2048w\" sizes=\"auto, (max-width: 3633px) 100vw, 3633px\" \/><\/p>\n<p>\u9700\u8981\u6ce8\u610f\u7684\u662f\uff0c\u5b98\u65b9\u7684torch\u7684backend\u4f7f\u7528\u7684\u662fsox\uff0c\u8fd9\u91cc\u6539\u6210\u4e86soundfile\uff1a<\/p>\n<pre><code>torchaudio.set_audio_backend('soundfile')\r\n<\/code><\/pre>\n<p>\u53ef\u80fd\u4f1a\u6709\u4e00\u4e9bbug\uff0c\u540e\u7eed\u8fd8\u8bf7\u5173\u6ce8\u5b98\u65b9\u7684\u9879\u76ee\u66f4\u65b0\u3002<\/p>\n<h4>MacOS\u5e73\u53f0<\/h4>\n<p>\u73b0\u5728\u6765\u5230MacOs\u5e73\u53f0\uff0c\u8fd8\u662f\u5148\u514b\u9686\u9879\u76ee\uff1a<\/p>\n<pre><code>git clone https:\/\/github.com\/v3ucn\/CosyVoice_for_MacOs.git\r\n<\/code><\/pre>\n<p>\u5b89\u88c5\u4f9d\u8d56\uff1a<\/p>\n<pre><code>cd CosyVoice_for_MacOs  \r\nconda create -n cosyvoice python=3.8  \r\nconda activate cosyvoice  \r\npip install -r requirements.txt -i https:\/\/mirrors.aliyun.com\/pypi\/simple\/ --trusted-host=mirrors.aliyun.com\r\n<\/code><\/pre>\n<p>\u968f\u540e\u9700\u8981\u901a\u8fc7Homebrew\u5b89\u88c5sox:<\/p>\n<pre><code>brew install sox\r\n<\/code><\/pre>\n<p>\u5982\u6b64\u5c31\u914d\u7f6e\u597d\u4e86\uff0c\u4f46\u662f\u522b\u5fd8\u4e86\u6dfb\u52a0\u73af\u5883\u53d8\u91cf\uff1a<\/p>\n<pre><code>export PYTHONPATH=third_party\/AcademiCodec:third_party\/Matcha-TTS\r\n<\/code><\/pre>\n<p>\u4f7f\u7528\u65b9\u5f0f\u548cWindows\u7248\u672c\u4fdd\u6301\u4e00\u81f4\u3002<\/p>\n<p>\u8fd9\u91cc\u8fd8\u662f\u63a8\u8350\u4f7f\u7528webui:<\/p>\n<pre><code>python3 webui.py --port 50000 --model_dir speech_tts\/CosyVoice-300M\r\n<\/code><\/pre>\n<p>\u8bbf\u95ee\u00a0http:\/\/localhost:50000<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-7870\" title=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\u5f00\u6e90\u9879\u76ee\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-2\" src=\"https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5.png\" alt=\"CosyVoice\uff1a\u963f\u91cc\u63a8\u51fa\u76843\u79d2\u6025\u901f\u8bed\u97f3\u514b\u9686\u5f00\u6e90\u9879\u76ee\uff0c\u652f\u6301\u60c5\u611f\u63a7\u5236\u6807\u7b7e-2\" width=\"3620\" height=\"1996\" srcset=\"https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5.png 3620w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5-300x165.png 300w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5-1024x565.png 1024w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5-768x423.png 768w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5-1536x847.png 1536w, https:\/\/www.kdjingpai.com\/wp-content\/uploads\/2024\/10\/d5d1a7c1db5bed5-2048x1129.png 2048w\" sizes=\"auto, (max-width: 3620px) 100vw, 3620px\" \/><\/p>\n<h4>\u7ed3\u8bed<\/h4>\n<p>\u5e73\u5fc3\u800c\u8bba\uff0cCosyVoice\u4e0d\u6127\u662f\u5927\u5382\u51fa\u54c1\uff0c\u6a21\u578b\u7684\u54c1\u8d28\u6ca1\u7684\u8bf4\uff0c\u4ee3\u8868\u4e86\u56fd\u5185AI\u7684\u6700\u9ad8\u6c34\u51c6\uff0c\u901a\u4e49\u5b9e\u9a8c\u5ba4\u540d\u4e0b\u65e0\u865a\uff0c\u5f53\u7136\uff0c\u5982\u679c\u80fd\u5c06\u5de5\u7a0b\u5316\u4e4b\u540e\u7684\u4ee3\u7801\u4e5f\u5f00\u6e90\u51fa\u6765\uff0c\u90a3\u5c31\u66f4\u597d\u4e86\uff0c\u76f8\u4fe1\u7ecf\u8fc7libtorch\u7684\u4f18\u5316\uff0c\u8fd9\u4e2a\u6a21\u578b\u5c06\u4f1a\u662f\u5f00\u6e90TTS\u7684\u4e0d\u4e8c\u9009\u62e9\u3002<\/p>\n<p>&nbsp;<\/p>\n<h3>\u4f7f\u7528\u6d41\u7a0b<\/h3>\n<ol>\n<li><strong>\u8bed\u97f3\u751f\u6210<\/strong>\uff1a\n<ul>\n<li>\u51c6\u5907\u8f93\u5165\u6587\u672c\u6587\u4ef6\uff08\u4f8b\u5982\uff1ainput.txt\uff09\uff0c\u6bcf\u884c\u4e00\u4e2a\u53e5\u5b50\u3002<\/li>\n<li>\u8fd0\u884c\u4ee5\u4e0b\u547d\u4ee4\u8fdb\u884c\u8bed\u97f3\u751f\u6210\uff1a\n<pre><code>python generate.py --input input.txt --output output\/\r\n<\/code><\/pre>\n<\/li>\n<li>\u751f\u6210\u7684\u8bed\u97f3\u6587\u4ef6\u5c06\u4fdd\u5b58\u5728<code>output\/<\/code>\u76ee\u5f55\u4e0b\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u8bed\u97f3\u514b\u9686<\/strong>\uff1a\n<ul>\n<li>\u51c6\u5907\u76ee\u6807\u8bf4\u8bdd\u4eba\u7684\u8bed\u97f3\u6837\u672c\u6587\u4ef6\uff08\u4f8b\u5982\uff1asample.wav\uff09\u3002<\/li>\n<li>\u8fd0\u884c\u4ee5\u4e0b\u547d\u4ee4\u8fdb\u884c\u8bed\u97f3\u514b\u9686\uff1a\n<pre><code>python clone.py --sample sample.wav --text input.txt --output output\/\r\n<\/code><\/pre>\n<\/li>\n<li>\u514b\u9686\u7684\u8bed\u97f3\u6587\u4ef6\u5c06\u4fdd\u5b58\u5728<code>output\/<\/code>\u76ee\u5f55\u4e0b\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u60c5\u611f\u63a7\u5236<\/strong>\uff1a\n<ul>\n<li>\u5728\u751f\u6210\u8bed\u97f3\u65f6\uff0c\u53ef\u4ee5\u901a\u8fc7\u547d\u4ee4\u884c\u53c2\u6570\u8c03\u8282\u60c5\u611f\uff1a\n<pre><code>python generate.py --input input.txt --output output\/ --emotion happy\r\n<\/code><\/pre>\n<\/li>\n<li>\u652f\u6301\u7684\u60c5\u611f\u5305\u62ec\uff1ahappy, sad, angry, neutral\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u7ca4\u8bed\u5408\u6210<\/strong>\uff1a\n<ul>\n<li>\u51c6\u5907\u7ca4\u8bed\u6587\u672c\u6587\u4ef6\uff08\u4f8b\u5982\uff1acantonese_input.txt\uff09\u3002<\/li>\n<li>\u8fd0\u884c\u4ee5\u4e0b\u547d\u4ee4\u8fdb\u884c\u7ca4\u8bed\u8bed\u97f3\u751f\u6210\uff1a\n<pre><code>python generate.py --input cantonese_input.txt --output output\/ --language cantonese\r\n<\/code><\/pre>\n<\/li>\n<li>\u751f\u6210\u7684\u7ca4\u8bed\u8bed\u97f3\u6587\u4ef6\u5c06\u4fdd\u5b58\u5728<code>output\/<\/code>\u76ee\u5f55\u4e0b\u3002<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<h3>\u8be6\u7ec6\u64cd\u4f5c\u6d41\u7a0b<\/h3>\n<ol>\n<li><strong>\u6587\u672c\u51c6\u5907<\/strong>\uff1a\n<ul>\n<li>\u786e\u4fdd\u8f93\u5165\u6587\u672c\u6587\u4ef6\u683c\u5f0f\u6b63\u786e\uff0c\u6bcf\u884c\u4e00\u4e2a\u53e5\u5b50\u3002<\/li>\n<li>\u6587\u672c\u5185\u5bb9\u5e94\u5c3d\u91cf\u7b80\u6d01\u660e\u4e86\uff0c\u907f\u514d\u590d\u6742\u7684\u53e5\u5f0f\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u8bed\u97f3\u6837\u672c\u51c6\u5907<\/strong>\uff1a\n<ul>\n<li>\u8bed\u97f3\u6837\u672c\u5e94\u4e3a\u6e05\u6670\u7684\u5355\u4eba\u8bed\u97f3\uff0c\u80cc\u666f\u566a\u97f3\u5c3d\u91cf\u5c11\u3002<\/li>\n<li>\u6837\u672c\u957f\u5ea6\u5efa\u8bae\u57281\u5206\u949f\u4ee5\u5185\uff0c\u4ee5\u786e\u4fdd\u514b\u9686\u6548\u679c\u6700\u4f73\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u53c2\u6570\u8c03\u8282<\/strong>\uff1a\n<ul>\n<li>\u6839\u636e\u9700\u8981\u8c03\u8282\u751f\u6210\u8bed\u97f3\u7684\u53c2\u6570\uff0c\u5982\u60c5\u611f\u3001\u8bed\u8a00\u7b49\u3002<\/li>\n<li>\u53ef\u4ee5\u901a\u8fc7\u4fee\u6539\u914d\u7f6e\u6587\u4ef6\u6216\u547d\u4ee4\u884c\u53c2\u6570\u5b9e\u73b0\u4e2a\u6027\u5316\u8bbe\u7f6e\u3002<\/li>\n<\/ul>\n<\/li>\n<li><strong>\u7ed3\u679c\u9a8c\u8bc1<\/strong>\uff1a\n<ul>\n<li>\u751f\u6210\u7684\u8bed\u97f3\u6587\u4ef6\u53ef\u4ee5\u901a\u8fc7\u97f3\u9891\u64ad\u653e\u5668\u8fdb\u884c\u8bd5\u542c\u3002<\/li>\n<li>\u5982\u679c\u6548\u679c\u4e0d\u7406\u60f3\uff0c\u53ef\u4ee5\u8c03\u6574\u8f93\u5165\u6587\u672c\u6216\u8bed\u97f3\u6837\u672c\uff0c\u91cd\u65b0\u751f\u6210\u3002<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>CosyVoice\u662f\u4e00\u4e2a\u591a\u8bed\u8a00\u5927\u89c4\u6a21\u8bed\u97f3\u751f\u6210\u6a21\u578b\uff0c\u63d0\u4f9b\u4ece\u63a8\u7406\u3001\u8bad\u7ec3\u5230\u90e8\u7f72\u7684\u5168\u6808\u80fd\u529b\u3002\u8be5\u9879\u76ee\u7531FunAudioLLM\u56e2\u961f\u5f00\u53d1\uff0c\u65e8\u5728\u901a\u8fc7\u5148\u8fdb\u7684\u81ea\u56de\u5f52\u53d8\u6362\u5668\u548c\u57fa\u4e8eODE\u7684\u6269\u6563\u6a21\u578b\uff0c\u5b9e\u73b0\u9ad8\u8d28\u91cf\u7684\u8bed\u97f3\u5408\u6210\u3002CosyVoice\u4e0d\u4ec5\u652f\u6301\u591a\u8bed\u8a00\u8bed\u97f3\u751f\u6210\uff0c\u8fd8&#8230;<\/p>\n","protected":false},"author":1,"featured_media":61140,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[230,237],"class_list":["post-7622","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tool","tag-aikaiyuanxiangmu","tag-aiyuyinkelong"],"_links":{"self":[{"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/posts\/7622","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/comments?post=7622"}],"version-history":[{"count":0,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/posts\/7622\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/media\/61140"}],"wp:attachment":[{"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/media?parent=7622"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/categories?post=7622"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.kdjingpai.com\/de\/wp-json\/wp\/v2\/tags?post=7622"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}