12AI 体験 / AI Hands-on Practice


  1. 画像認識の機械学習
    Machine Learning for Image Recognition
  2. 自然言語処理の機械学習
    Machine Learning in Natural Language Processing

1. 画像認識の機械学習/Machine Learning for Image Recognition

本ウェブページは『超入門 はじめてのAI・データサイエンス』第12章に記載されたコードを埋め込んでいます。第4章で使い方を学んだColab上でコードを実行すれば,基本の画像認識教材として使われるMNISTを体験できます。

This webpage corresponds to Session 12 of the English website. By running the code on Colab, the use of which you learned in Session 04, you can experience MNIST, a basic learning material for image recognition.

12.1.1 手書き数字データセット「MNIST」/Handwritten Digit Dataset "MNIST"

ここで使用するMNISTとは,Mixed National Institute of Standards and Technology databaseの略で,手書き数字のデータセットです。

MNIST used here stands for Mixed National Institute of Standards and Technology database, a dataset of handwritten digits.

12.1.2 MNIST データセットの読み込みとデータの成形/Reading MNIST Datasets and Shaping Data


After selecting GPU as instructed in the English website, load the MNIST dataset with code-1 below.

from tensorflow.keras.datasets import mnist
(x_train, y_train),(x_test, y_test) = mnist.load_data()


Use code-2 for preprocessing data.

x_train = x_train.reshape(-1, 28, 28, 1) / 255.0
x_test = x_test.reshape(-1, 28, 28, 1) / 255.0


Preprocess the label data with code-3 below.

import tensorflow as tf
y_train = tf.keras.utils.to_categorical(y_train, num_classes=10)
y_test = tf.keras.utils.to_categorical(y_test, num_classes=10)

12.1.3 機械学習モデルの構築/Building Machine Learning Models


Use code-4 below to build a model.

# 必要なモジュールを読み込む
# importing necessary modules
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout

# kerasでモデルを積み重ねる土台を呼び出す
# calling the foundation for stacking models with keras
model = Sequential()

# 畳み込みフィルターを設定する
# 32 convolution filters used each of size 3x3
model.add(Conv2D(32, kernel_size=(3, 3), activation="relu",
input_shape=(28, 28, 1)))
# 64 convolution filters used each of size 3x3
model.add(Conv2D(64, (3, 3), activation="relu"))

# プーリング処理を施す
# choose the best features via pooling
model.add(MaxPooling2D(pool_size=(2, 2)))

# ドロップアウト法を適用する
# randomly turn neurons on and off to improve convergence

# Flattenを用いて1次元化する
# flatten since too many dimensions, we only want a classification output

# 全結合を1次元化されたデータに対して行う
# fully connected to get all relevant data
model.add(Dense(128, activation="relu"))

# 活性化関数を設定し,予測を確率で返してくれる
# output a softmax to squash the matrix into output probabilities
model.add(Dense(10, activation="softmax"))


Use code-5 to compile the model.

# 最適化関数,損失関数, 評価関数を設定する
# setting optimizer, loss function, and metrics

12.1.4 モデルの学習/Learning Models


With code-6, the model is trained on the data.

model.fit(x_train, y_train, batch_size=128, epochs=10, 
validation_data=(x_test, y_test))


Run code-7, then images and and predictions by the model show.

import numpy as np
import matplotlib.pyplot as plt

# テストセットからランダムに画像を抜き出す
# Select a random subset of images from the test set
num_images = 5
random_indices = np.random.randint(0, len(x_test), num_images)
images = x_test[random_indices]
labels = y_test[random_indices]

# 選択した画像について予測する
# Make predictions on the selected images
predictions = model.predict(images)
predicted_labels = np.argmax(predictions, axis=1)

# 画像と予測されたラベルおよびスコアを表示する
# Display the images with their predicted labels and scores
fig, axes = plt.subplots(1, num_images, figsize=(12, 3))

for i in range(num_images):
    axes[i].imshow(images[i].reshape(28, 28), cmap="gray")
        f"Predicted: {predicted_labels[i]}\nScore:{round(np.max(predictions[i]), 8)}"


2.自然言語処理の機械学習/Machine Learning in Natural Language Processing


Natural Language Processing (NLP), the most popular type of AI at the moment, is explained in the English website.

12.2.9 感情分析を試してみよう/Let's Try Sentiment Analysis


Import libraries with code-8.

# 必要なライブラリのインストール
# Installing necessary libraries
!pip install -q transformers
!pip install ipadic
!pip install fugashi
!pip install xformers


Running code-9 makes a class, a model, and a pipeline object stored in the Transformer ready to be called.

from transformers import AutoTokenizer,BertForSequenceClassification,pipeline


Encoding is done by code-10.

# Tokenizerの読み込み
# Load the tokenizer

# モデルの読み込み
# Load the model
model = BertForSequenceClassification.from_pretrained("koheiduck/bert-japanese-finetuned-sentiment")


Sentiment Analysis is performed by running code-11.

("I am happy."))