YOSHIOKA, Masaharu

YOSHIOKA, Masaharu

Principal Investigator, Professor

Hokkaido University

Related Website

Web site

Contact

yoshioka atmark ist.hokudai.ac.jp

YOSHIOKA, Masaharu Group

Principal Investigator

YOSHIOKA, Masaharu

Faculty Members

NATH, Pinku

Postdoctoral Fellows

RAGHAVAN, Sriram Srinivasa

About the Research

Research Theme

Information/knowledge extraction from research papers and its utilization

Keyword

Knowledge management, Ontology, Information Retrieval, Natural Language Processing, Design theory

Research Outline

I aim to extract information about chemical reaction network graphs contained in the collection of scientific papers and use it to analyze chemical reactions. Developing such tools and providing them to chemists will serve as a starting point to establish a growing cooperation framework between data scientists and chemists which can yield valuable insights into the global chemical research landscape for the chemists, and for the data scientist an inroads to understanding the research design process and needs of specialists.

Natural language processing is the traditional tool to extract this kind of information, but it is difficult to recruit the help of experts for the necessary annotation step that provides the labels computers can use to understand which parts of a text signifies what concept. Nevertheless, using the above-mentioned tools, I hope that the close collaboration with chemists at ICReDD will provide me with an opportunity to establish a framework of mutual benefit. In addition, the recently developed neural net-based tools promise to reduce the burden of annotation by being able to process the large amounts of data available in the ever-growing literature to extract characteristic chemical information automatically.

The Researcher’s Perspective

My PhD supervisor was Professor Hiroyuki Yoshikawa, who is an eminent figure in the field of General Design Theory. He always emphasized that scientific findings should be to the benefit of humankind and so he coined the term of the modern evil. Modern evils are particular design solutions that cause greater problems elsewhere, such as the production of cheap and labor-saving plastic straws that are serious pollutants for the environment. I would like to contribute to overcoming such modern evils by understanding the design process itself and then expanding that understanding in the view of the wider context.

For details on MANABIYA course topics, please follow this link. To learn more about MANABIYA in general, please click here.

Representative Research Achievements

Construction of an In-House Paper/Figure Database System Using Portable Document Format Files
Masaharu Yoshioka, and Shinjiro Hara, In Information Search, Integration, and Personalization: 10th International Workshop, ISIP 2018, Dimitris Kotzinos, Dominique Laurent, Nicolas Spyratos, Yuzuru Tanaka, and Rin-ichiro Taniguchi (eds), Springer-Verlag GmbH, CCIS 1040, 2019, 142-156
Extraction of Chemical and Drug Named Entities by Ensemble Learning Using Chemical NER Tools Based on Different Extraction Guidelines
Thaer M. Dieb and Masaharu Yoshioka, Transactions on Machine Learning and Data Mining, 2015, Vol. 8, No. 2, 61-76
Framework for Automatic Information Extraction from Research Papers on Nanocrystal Devices
Thaer M. Dieb, Masaharu Yoshioka, Shinjiroh Hara, and Marcus C. Newton, Beilstein Journal of Nanotechnology, 2015, Vol. 6, 1872-1882
DOI: 10.3762/bjnano.6.190
On a Combination of Probabilistic and Boolean IR Models for WWW Document Retrieval
Masaharu Yoshioka and Makoto Haraguchi, ACM Transactions on Asian Language Information Processing (TALIP), 2005, Vol. 4, No. 3, 340-356
DOI: 10.1145/1111667.1111674
Physical Concept Ontology for the Knowledge Intensive Engineering Framework
Masaharu Yoshioka, Yasushi Umeda, Hideaki Takeda, Yoshiki Shimomura, Yutaka Nomaguchi, Tetsuo Tomiyama, Advanced Engineering Informatics, 2004, Vol. 18, No. 2, 95-113
DOI: 10.1016/j.aei.2004.09.004

Related Research

Development of machine learning framework for automatically extracting detailed synthesis procedures from organic chemistry articles

Publications

2024

A Framework for Reviewing the Results of Automated Conversion of Structured Organic Synthesis Procedures from the Literature
K. Machi, S. Akiyama, Y. Nagata, M. Yoshioka, Digital Discovery, 2024, ,
DOI: 10.1039/d4dd00335g

2023

OSPAR: A Corpus for Extraction of Organic Synthesis Procedures with Argument Roles
K. Machi, S. Akiyama, Y. Nagata, M. Yoshioka, J. Chem. Inf. Model., 2023, 63, 21, 6619-6628
DOI: 10.1021/acs.jcim.3c01449

2022

Data-Augmentation Method for BERT-Based Legal Textual Entailment Systems in COLIEE Statute Law Task
Y. Aoki, M. Yoshioka, Y. Suzuki, Review of Socionetwork Strategies, 2022, 16, 175-196
DOI: 10.1007/s12626-022-00104-0