We are trying to to build a product that recognizes text from a live environment video streams and process the same into speech. A preliminary understanding of image processing and Text-to-Speech engines helps build software for the same. However, we are trying to reduce latency to a margin of ~0.25 - 0.5 s.
Sir, I have expertise in the field of Image processing, speech recognition and computer vision and I can provide you your complete task in decided time frame with quality work. Share your details with me in the message box
Regards