Speech recognition based on visual information such as the lip shape and its
movement is referred to as lip reading. The visual features are derived according to the
frame rate of the video sequence. The proposed work adopted in this paper is based
upon the lower part of the human face to extract the speaker sound relevant features
accurately and robustly from the inner edge of the lips and trace it acoustically to prove
its unique features and the possibility of merging it with sound features by measuring
their physiological or behavioral characteristics curves. The results were promising and
offered a good reaction: 94% - 100%.