豊橋技術科学大学

Search

Search

Murakoshi, Kazushi

Affiliation Department of Computer Science and Engineering
Title Associate Professor
Fields of Research Computational Intelligence / Neural Informaion Science
Degree Dr. of Engineering (Chiba University)
Academic Societies Institute of Electronics, Information and Communication Engineers (IEICE) / Japanese Neural Network Society (JNNS) / Japanese Congnitice Science Society (JCSS) / Vison Society of Japan / Japan Neuroscience Society / Society for Neuroscience
E-mail mura@
Please append "tut.jp" to the end of the address above.
Laboratory website URL http://www.ci.cs.tut.ac.jp/
Researcher information URL(researchmap) Researcher information

Research

人間・動物はすばらしい情報処理機能を持っているが、まだ解明されていない機能も多い。そこで、その機能の情報処理過程を解明し、人工的な情報処理で壁・困難な問題にぶつかったとき、生体情報処理の方法に学び打開策を検討する。人間・動物の情報処理機構を検討するためには、広範な学問領域の検討が必要である。そのため、生理学・心理学的な知見をよく吟味し、情報科学的アプローチをとりながら多視点から研究を進めていく。最終目標は人間・動物と同等か上回る機能を持つ人工物を作ることである。

Theme1:Neural Circuit Model forming Semantic Network

Overview

It is said that humans store the semantic network, which is a linked network of knowledge. We suggest a neural circuit model explaining how a human being learns a network of such a semantic network. The outline of the model is shown in Figure. the association cortex 1 layer passes the inputs to the dentatus gyrus, the CA1, and the association cortex 2 layers, and the CA1 layer produces the learned time sequence patters from the association cortex 1 layer via the dentatus gyrus and the CA3. The association area 2 layers are consists of two layer: 2a and 2b; 2a stores episodic memory as personal experience and 2b stores semantic memory. STDP (Spike-Timing-Dependent synaptic Plasticity: STDP) was used to learning rule in association area 2b to examine the relation of STDP observed in physiology and forming semantic memory. STDP is the phenomenon that transmission efficiency changes by relative timing of the firings of connected neurons, and it is thought as the origin of various learning in the living thing. The thick arrows of figure 2 show the nerve connections that change by STDP.

Keywords

human memory, STDP

Theme2:Flexible reinforcement learning algorithm

Overview

Humans learn from the result (reward) that they obtained by doing action through try and error, and can perform reinforcement learning to acquire the best action. Such reinforcement learning had been suggested, however, the conventional learning method had the problem of taking much time in re-learning. Therefore, we have proposed a proper parameter control method in reinforcement learning, that is, flexible learning method. In figure 3, the new wall is newly located on the learned path from a start on the left-up to the goal of the right-down. Using the conventional reinforcement learning algorithm, the detour was not able to be searched out easily. Compared with our proposed technique, it was able to find the detour quickly.

Keywords

reinforcement learning, re-learning, flexible

to Pagetop