A computational model has been proposed which is capable of simulating early phases of speech acquisition, speech production, and speech perception. The model comprises two main modules, i.e. mental lexicon and action repository. The mental lexicon activates semantic and phonological representations of words (cognitive level) while the action repository activates sensory and motor representations of syllables. The model has been implemented and tested by simulating early phases of speech acquisition (i.e. babbling phase and imitation phase) and performing production and perception tests after learning.