This project integrates real-time object recognition with text-to-speech capabilities, providing an interactive experience that announces detected objects to the user. It uses the YOLO (You Only Look ...