[![ไธญๆ–‡](https://img.shields.io/badge/Chinese-ไธญๆ–‡-blue)](./README.md) [![Downloads](https://img.shields.io/github/downloads/Chenyme/Chenyme-AAVT/total.svg?style=flat-square&label=Downloads)](https://github.com/Chenyme/Chenyme-AAVT/releases) [![Telegram Group](https://img.shields.io/badge/Telegram-Group-blue?logo=telegram)](https://t.me/+j8SNSwhS7xk1NTc9) [![Latest Release](https://img.shields.io/github/v/release/Chenyme/Chenyme-AAVT.svg?style=flat-square)](https://github.com/Chenyme/Chenyme-AAVT/releases) ![PyPI - Version](https://img.shields.io/pypi/v/AAVT?logo=pypi) [![License](https://img.shields.io/github/license/Chenyme/Chenyme-AAVT.svg?style=flat-square)](https://github.com/Chenyme/Chenyme-AAVT/blob/main/LICENSE) [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Kirie233/Chenyme-AAVT/blob/main/AAVT.ipynb)
--- > [!NOTE] > ๐ŸŒŸ **If this project helps you, please remember to give it a Star ๐ŸŒŸ for support!** > > ๐Ÿ“ **It is recommended to use the **Large** model for better experience.** > > ๐Ÿ“– [**Installation Guide**](https://blog.chenyme.top/blog/aavt-install) | โ“ [**FAQ**](https://blog.chenyme.top/blog/aavt-qa) | ๐Ÿ’ฌ [**Telegram Group**](https://t.me/+j8SNSwhS7xk1NTc9)
## Project Introduction **Chenyme-AAVT Automatic Video Translation Project** aims to provide a simple, efficient, and free automation process for media recognition and translation, helping you quickly complete tasks such as audio and video subtitle recognition, translation, and processing. Currently, the project not only helps you recognize and translate sound but also automates the generation of marketing blog content, and even performs separate subtitle translations. Future plans include adding more interesting tools based on existing basic functions, such as real-time recognition, lip-sync correction, voice cloning, timbre differentiation, and more. Stay tuned! Basic supported features, not all features: - ใ€[Audio Recognition](?tab=readme-ov-file#audio-recognition)ใ€‘|ใ€[Video Recognition](?tab=readme-ov-file#video-recognition)ใ€‘|ใ€[Blog Generation](?tab=readme-ov-file#blog-generation)ใ€‘|ใ€[Subtitle Translation](?tab=readme-ov-file#subtitle-translation)ใ€‘|ใ€[Voice Simulation](?tab=readme-ov-file#voice-simulation)ใ€‘
![20240820210851.jpg](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/home.jpg)
## Project Highlights
๐Ÿ“ƒ TODO | Tasks
### Recognition - [x] Replaced with faster Whisper project - [x] Supports local model loading - [x] Supports personal fine-tuning of Whisper models - [x] VAD-assisted optimization - [x] Word-level sentence segmentation optimization - [x] More language recognition ### Translation - [x] Translation optimization - [x] More language translations - [x] More translation models - [x] More translation engines - [x] Supports local large language model translation ### Subtitles - [x] Personalized subtitles - [x] More subtitle formats - [x] Subtitle preview, real-time editing - [ ] Automated subtitle text proofreading - [ ] Dual subtitles ### Other - [x] AI Assistant - [x] Video preview - [x] Blog generation from videos - [ ] Real-time voice translation - [ ] Chinese voiceover for videos - [ ] Timbre differentiation - [ ] Voice cloning - [ ] Lip-sync correction
- Supports recognition and translation of **multiple languages** - Supports **localized, free deployment** of the entire process - Supports **one-click generation of blog content, marketing blog** from videos - Supports **automated translation**, **secondary subtitle editing**, **video preview** - Supports **GPU acceleration**, **VAD assistance**, **FFmpeg acceleration** - Supports using various large models like **ChatGPT**, **Claude**, **Gemini**, **DeepSeek** for translation engines
## Windows Deployment
๐Ÿ‘‰ Prerequisites: Python, FFmpeg, CUDA Instructions
### Python | ๐Ÿ“– [Guide](https://blog.chenyme.top/blog/aavt-install#47a521d01156) - ๐Ÿ’ก Choose Python version > 3.8 - Go to the official Python website to download the installer - Run the installation and make sure to check the ADD TO PATH option ### FFMpeg | ๐Ÿ“– [Guide](https://blog.chenyme.top/blog/aavt-install#1897915fc461) - ๐Ÿ’ก If you are unsure how to install and compile, directly download the `Win` version from the projectโ€™s Release page, which comes with a pre-compiled FFmpeg - Go to the official FFmpeg website to download the compiled Windows version - Set FFmpeg as an environment variable ### CUDA (Skip for CPU) | ๐Ÿ“– [Guide](https://blog.chenyme.top/blog/aavt-install#1faea2d7295f) - ๐Ÿ’ก Recommended versions are CUDA 11.8, 12.1, 12.4 - Go to the CUDA website to download the installer - Install CUDA ##  

> โ€ผ๏ธ Make sure the prerequisites are ready before proceeding to the following stepsโ€ผ๏ธ > ### 1. Run Deployment Script > - Go to the Release page to download the latest `Win` version (Win/Small) > - Run `1_Install.bat` and wait for the script to check > - After passing, follow the prompts to choose the version for installation > ### 2. Run the Project Web > - Run `2_WebUI.bat` > - Enter `chenymeaavt` to access the project (this is a protection feature of the new version, can be turned off) > >   > > โ„น๏ธ **The WebUI will automatically launch, if it doesnโ€™t, manually enter `localhost:8501` in your browser**
## Mac OS Deployment
๐Ÿ‘‰ Prerequisites: Python, Brew Instructions
### Python - ๐Ÿ’ก Choose Python version > 3.8 - Go to the Python website to download the PGK installer - Run the installation and select the standard install on the page ### Brew - ๐Ÿ’ก Use the following command for one-click installation of `brew` ``` /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" ``` ##  

> โ€ผ๏ธ Make sure the prerequisites are ready before proceeding to the following stepsโ€ผ๏ธ > ### 1. Install FFmpeg > ``` > brew install FFMpeg > ``` > ### 2. Install Project Dependencies > - Go to the Release page to download the latest `Mac` version (Mac/Small) > - cd to the project root directory > ``` > pip3 install -r requirements.txt > ``` > ### 3. Run the Project Web > ``` > streamlit run Chenyme-AAVT.py > ``` > - Enter `chenymeaavt` to access the project (this is a protection feature of the new version, can be turned off) > >   > > โ„น๏ธ **The WebUI will automatically launch, if it doesnโ€™t, manually enter `localhost:8501` in your browser**
## Docker Deployment > ๐Ÿ’ก Currently, the latest project version is V0.9.0. The Docker method is for version V0.8.x. > > Thanks to @Eisaichen for providing this version For detailed usage, please refer to: ๐Ÿ“– [eisai/chenyme-aavt](https://hub.docker.com/r/eisai/chenyme-aavt) ```shell docker pull eisai/chenyme-aavt ```
## Other Deployment Methods ### Google Colab Deployment > Thanks to @Kirie233 for providing this version For detailed usage instructions, please refer to: [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Kirie233/Chenyme-AAVT/blob/main/AAVT.ipynb)
### Docker Deployment > ๐Ÿ’ก The current latest project version is V0.9.0. This Docker method is for version V0.8.x, > > Thanks to @Eisaichen for providing this version ```commandline docker pull eisai/chenyme-aavt ``` For detailed usage instructions, please refer to: ๐Ÿ“– [eisai/chenyme-aavt](https://hub.docker.com/r/eisai/chenyme-aavt)
### Linux Deployment > As my computer is currently left at school, I havenโ€™t studied this yet. However, I believe solving FFmpeg and CUDA should work fine.

## Star History [![Star History Chart](https://api.star-history.com/svg?repos=Chenyme/Chenyme-AAVT&type=Timeline)](https://star-history.com/#Chenyme/Chenyme-AAVT&Timeline)

### Homepage BOT
![11](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/bot.png)
### Some Settings
![12](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/setting.png)
### Audio Recognition
![13](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/audio.png)
### Video Recognition
![14](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/video.png)
### Blog Generation
![15](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/blog.png)
### Subtitle Translation
![16](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/srt.png)
### Voice Simulation
![17](https://github.com/Chenyme/Chenyme-AAVT/blob/main/cache/public/picture/opentts.png)
## Acknowledgements I have greatly benefited from the AI era, and this project has largely been realized by standing on the shoulders of giants. Thanks to the open-source spirit, and thanks to the developers of OpenAI, Streamlit, FFmpeg, Faster-whisper, and more!