Stereo images are captured using cameras connected to PC. These images are segmented and stored in the compressed form. The compressed segmented images are transmitted to the ARM-9 processor based system by using ZIGBEE wireless module. These images, when received at the receiver end, are recovered, and a 3-D image is generated on the TFT display. Depth levels are also estimated from segmented stereo images and transmitted through ZIGBEE module to ARM-9 processor based hardware. Depth levels received at the hardware are used to control a robot. This proposal is a prototype that can be implemented for vision based industrial applications. The present paper deals with the transmitter-receiver link for stereo images and movement of robot proportional to estimated depth levels.