Here we investigate the unique advantages of our proposed Virtual Space Teleconferencing System (VST) in the area of multimedia teleconferencing, with emphasis to facial emotion transmission and recognition. Specially we show that, using this concept, emotions of a local participant can be transmitted to the remote party with higher recognition rate by enhancing the emotions using some intelligence processing in between the local and the remote participants. This leads to a kind of emotion enhanced teleconferencing system which can supersede face to face meetings, by effectively alleviating the barriers in recognizing emotions between different nations. Also in this paper we state about a concept known as a virtual person, which is a better alternative to blurred or mosaiced facial images that one can find in some television interviews with people who are not willing to be exposed in public. Finally we compare the amount of data rate required for the proposed method with two other available methods, and confirm that our approach needs a very low data rate compared to those methods.