3 years ago · 3e33303552
--- a/README.md
+++ b/README.md
@@ -1,33 +1,36 @@
 
				 # video-subtitle-extract
			
 
				 
			
 
				-无字幕文件的视频，字幕通过opencv图片识别方法抽取其中的文字。
			
 
				+videos extract subtitles. The project includes python (video processing) and java (picture to text) source.
			
 
				 
			
 
				-## 引言
			
 
				+## Introduction
			
 
				 
			
 
				-本工具主要为新闻媒体，自媒体等公司服务。实现了一下功能：
			
 
				+This tool mainly serves companies such as news media and self-media. Implemented the following functions:
			
 
				 
			
 
				-对视频中的字幕进行定位，
			
 
				+Position the subtitles in the video,
			
 
				 
			
 
				-对视频中字幕通过 **自然语言识别** 功能**提取其中的字幕文字**。
			
 
				+Use the **Natural Language Recognition** function to extract the subtitle text in the video.
			
 
				 
			
 
				 ## Usage
			
 
				 
			
 
				-1. 把视频文件放到 videos 文件夹中。
			
 
				+1. Put the video file in the videos folder.
			
 
				 
			
 
				-2. 先执行 pre_do.py 文件，查看一下视频的帧速，视频尺寸等信息。
			
 
				+2. First execute the pre_do.py file to check the video frame rate, video size and other information.
			
 
				 
			
 
				     ![](assets/BaiduHi_2020-2-18_11-14-28.png)
			
 
				 
			
 
				-3. 通过上一步操作，修改 main.py 中的 参数，使得字幕图片被很好的截取。然后执行 main.py 会在 videos 文件夹中生成对视频分析后截取的字幕图片（10句合成一个图片，便于识别）。
			
 
				+3. Through the previous operation, modify the parameters in main.py so that the subtitle pictures are intercepted well. Then executing main.py will generate subtitle pictures (10 sentences combined into one picture for easy identification) in the videos folder after analyzing the video.
			
 
				 
			
 
				     ![](assets/BaiduHi_2020-2-18_11-16-10.png)
			
 
				 
			
 
				-4. 执行 MainFm.java 对上述图片进行批量文字识别（分别调用百度/搜狗等文字识别免费接口）。获得结果保存到 项目resxxx.txt 文件中。
			
 
				+4. Execute MainFm.java to perform batch text recognition on the above pictures (call Baidu/Sogou and other text recognition free interfaces respectively). The obtained results are saved in the project resxxx.txt file.
			
 
				 
			
 
				     ![](assets/BaiduHi_2020-2-18_11-13-33.png)
			
 
				 
			
 
				 
			
 
				-## 注意
			
 
				+## Note
			
 
				 
			
 
				-由于文字识别接口，可能识别错误。所以可以多执行几次，比较结果。
			
 
				+Due to the text recognition interface, the recognition may be wrong. So you can execute it a few times and compare the results.
			
 
				 
			
 
				+## Copyright
			
 
				+
			
 
				+mail: liuyuqi.gov@msn.cn
			
--- a/README_CN.md
+++ b/README_CN.md
@@ -0,0 +1,37 @@
 
				+# video-subtitle-extract
			
 
				+
			
 
				+无字幕文件的视频，字幕通过 opencv 图片识别方法抽取其中的文字，该项目包含python（视频处理），java（图片转文字）。
			
 
				+
			
 
				+## 引言
			
 
				+
			
 
				+本工具主要为新闻媒体，自媒体等公司服务。实现了一下功能：
			
 
				+
			
 
				+对视频中的字幕进行定位，
			
 
				+
			
 
				+对视频中字幕通过 **自然语言识别** 功能**提取其中的字幕文字**。
			
 
				+
			
 
				+## Usage
			
 
				+
			
 
				+1. 把视频文件放到 videos 文件夹中。
			
 
				+
			
 
				+2. 先执行 pre_do.py 文件，查看一下视频的帧速，视频尺寸等信息。
			
 
				+
			
 
				+    ![](assets/BaiduHi_2020-2-18_11-14-28.png)
			
 
				+
			
 
				+3. 通过上一步操作，修改 main.py 中的 参数，使得字幕图片被很好的截取。然后执行 main.py 会在 videos 文件夹中生成对视频分析后截取的字幕图片（10句合成一个图片，便于识别）。
			
 
				+
			
 
				+    ![](assets/BaiduHi_2020-2-18_11-16-10.png)
			
 
				+
			
 
				+4. 执行 MainFm.java 对上述图片进行批量文字识别（分别调用百度/搜狗等文字识别免费接口）。获得结果保存到 项目resxxx.txt 文件中。
			
 
				+
			
 
				+    ![](assets/BaiduHi_2020-2-18_11-13-33.png)
			
 
				+
			
 
				+
			
 
				+## 注意
			
 
				+
			
 
				+由于文字识别接口，可能识别错误。所以可以多执行几次，比较结果。
			
 
				+
			
 
				+## 版权
			
 
				+
			
 
				+mail: liuyuqi.gov@msn.cn
			
 
				+