Generative adversarial network, AI transforms pictures into comic style-AI-php.cn

Home

Technology peripherals

Generative adversarial network, AI transforms pictures into comic style

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 11, 2023 pm 09:58 PM

pictureaigan

Hello, everyone.

Everyone is playing with AI painting recently. I found an open source project on GitHub to share with you.

Generative adversarial network, AI transforms pictures into comic style

The project shared today is implemented using GAN Generative Adversarial Network. We have shared many articles before about the principles and practice of GAN. Friends who want to know more can read it Historical articles.

The source code and data set are obtained at the end of the article. Let’s share how to train and run the project.

1. Prepare the environment

Install tensorflow-gpu 1.15.0, use 2080Ti as the GPU graphics card, and cuda version 10.0.

git download project AnimeGANv2 source code.

After setting up the environment, you need to prepare the data set and vgg19.

Generative adversarial network, AI transforms pictures into comic style

Download the dataset.zip compressed file, which contains 6k real pictures and 2k comic pictures for GAN training.

Generative adversarial network, AI transforms pictures into comic style

vgg19 is used to calculate the loss, which will be introduced in detail below.

2. Network model

Generative adversarial network requires the definition of two models, one is the generator and the other is the discriminator.

The generator network is defined as follows:

with tf.variable_scope('A'):
inputs = Conv2DNormLReLU(inputs, 32, 7)
inputs = Conv2DNormLReLU(inputs, 64, strides=2)
inputs = Conv2DNormLReLU(inputs, 64)

with tf.variable_scope('B'):
inputs = Conv2DNormLReLU(inputs, 128, strides=2)
inputs = Conv2DNormLReLU(inputs, 128)

with tf.variable_scope('C'):
inputs = Conv2DNormLReLU(inputs, 128)
inputs = self.InvertedRes_block(inputs, 2, 256, 1, 'r1')
inputs = self.InvertedRes_block(inputs, 2, 256, 1, 'r2')
inputs = self.InvertedRes_block(inputs, 2, 256, 1, 'r3')
inputs = self.InvertedRes_block(inputs, 2, 256, 1, 'r4')
inputs = Conv2DNormLReLU(inputs, 128)

with tf.variable_scope('D'):
inputs = Unsample(inputs, 128)
inputs = Conv2DNormLReLU(inputs, 128)

with tf.variable_scope('E'):
inputs = Unsample(inputs,64)
inputs = Conv2DNormLReLU(inputs, 64)
inputs = Conv2DNormLReLU(inputs, 32, 7)
with tf.variable_scope('out_layer'):
out = Conv2D(inputs, filters =3, kernel_size=1, strides=1)
self.fake = tf.tanh(out)

The main module in the generator is the reverse residual block

Generative adversarial network, AI transforms pictures into comic style

The residual structure ( a) and reverse residual block (b)

The discriminator network structure is as follows:

def D_net(x_init,ch, n_dis,sn, scope, reuse):
channel = ch // 2
with tf.variable_scope(scope, reuse=reuse):
x = conv(x_init, channel, kernel=3, stride=1, pad=1, use_bias=False, sn=sn, scope='conv_0')
x = lrelu(x, 0.2)

for i in range(1, n_dis):
x = conv(x, channel * 2, kernel=3, stride=2, pad=1, use_bias=False, sn=sn, scope='conv_s2_' + str(i))
x = lrelu(x, 0.2)

x = conv(x, channel * 4, kernel=3, stride=1, pad=1, use_bias=False, sn=sn, scope='conv_s1_' + str(i))
x = layer_norm(x, scope='1_norm_' + str(i))
x = lrelu(x, 0.2)

channel = channel * 2

x = conv(x, channel * 2, kernel=3, stride=1, pad=1, use_bias=False, sn=sn, scope='last_conv')
x = layer_norm(x, scope='2_ins_norm')
x = lrelu(x, 0.2)

x = conv(x, channels=1, kernel=3, stride=1, pad=1, use_bias=False, sn=sn, scope='D_logit')

return x

3. Loss

Before calculating the loss, use the VGG19 network to convert the image Vectorization. This process is a bit like the Embedding operation in NLP.

Eembedding is to convert words into vectors, and VGG19 is to convert pictures into vectors.

Generative adversarial network, AI transforms pictures into comic style

VGG19 definition

The logic of calculating the loss part is as follows:

def con_sty_loss(vgg, real, anime, fake):

# 真实Generative adversarial network, AI transforms pictures into comic style向量化
vgg.build(real)
real_feature_map = vgg.conv4_4_no_activation

# 生成Generative adversarial network, AI transforms pictures into comic style向量化
vgg.build(fake)
fake_feature_map = vgg.conv4_4_no_activation

# 漫画风格向量化
vgg.build(anime[:fake_feature_map.shape[0]])
anime_feature_map = vgg.conv4_4_no_activation

# 真实Generative adversarial network, AI transforms pictures into comic style与生成Generative adversarial network, AI transforms pictures into comic style的损失
c_loss = L1_loss(real_feature_map, fake_feature_map)
# 漫画风格与生成Generative adversarial network, AI transforms pictures into comic style的损失
s_loss = style_loss(anime_feature_map, fake_feature_map)

return c_loss, s_loss

Here vgg19 is used to calculate the real image (parameter real) and generation respectively The loss of the picture (parameter fake), the loss of the generated picture (parameter fake) and the comic style (parameter anime).

c_loss, s_loss = con_sty_loss(self.vgg, self.real, self.anime_gray, self.generated)
t_loss = self.con_weight * c_loss + self.sty_weight * s_loss + color_loss(self.real,self.generated) * self.color_weight + tv_loss

Finally give different weights to these two losses, so that the pictures generated by the generator not only retain the appearance of the real pictures, but also migrate to the comic style

4. Training

Execute the following command in the project directory to start training

python train.py --dataset Hayao --epoch 101 --init_epoch 10

After the operation is successful, you can see the data.

Generative adversarial network, AI transforms pictures into comic style

At the same time, we can also see that losses are declining.

The source code and data set have been packaged. If you need it, just leave a message in the comment area.

If you think this article is useful to you, please click and read to encourage me. I will continue to share excellent Python AI projects in the future.

The above is the detailed content of Generative adversarial network, AI transforms pictures into comic style. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

超简单！用 Python 为图片和 PDF 去掉水印Apr 12, 2023 pm 11:43 PM

网上下载的 pdf 学习资料有一些会带有水印，非常影响阅读。比如下面的图片就是在 pdf 文件上截取出来的，今天我们就来用Python解决这个问题。安装模块PIL：Python Imaging Library 是 python 上非常强大的图像处理标准库，但是只能支持 python 2.7，于是就有志愿者在 PIL 的基础上创建了支持 python 3的 pillow，并加入了一些新的特性。pip install pillow pymupdf 可以用 python 访问扩展名为*.pdf、

如何使用 Vue 实现图片预加载？Jun 25, 2023 am 11:01 AM

在网页开发中，图片预载是一种常见的技术，可以提升用户的体验感。当用户浏览网页时，图片可以提前下载并加载，减少图片加载时的等待时间。在Vue框架中，我们可以通过一些简单的方法来实现图片预载。本文将介绍Vue中的图片预载技术，包括预载的原理、实现的方法和使用注意事项。一、预载的原理首先，我们来了解一下图片预载的原理。传统的图片加载方式是等到图片全部下载完成才显示

PHP和GD库实现图片裁剪的方法Jul 14, 2023 am 08:57 AM

PHP和GD库实现图片裁剪的方法概述：图片裁剪是网页开发中常见的需求之一，它可以用于调整图片的尺寸，剪裁不需要的部分，以适应不同的页面布局和展示需求。在PHP开发中，我们可以借助GD库来实现图片裁剪的功能。GD库是一个强大的图形库，可提供一系列函数来处理和操控图像。代码示例：下面我们将详细介绍如何使用PHP和GD库来实现图片裁剪。首先，确保你的PHP环境已经

如何在uniapp中实现图片滤镜效果Jul 04, 2023 am 11:05 AM

如何在uniapp中实现图片滤镜效果在移动应用开发中，图片滤镜效果是一种常见且受用户喜爱的功能之一。而在uniapp中，实现图片滤镜效果也并不复杂。本文将为大家介绍如何通过uniapp实现图片滤镜效果，并附上相关代码示例。导入图片首先，我们需要在uniapp项目中导入一张图片，以供后续滤镜效果的处理。可以在项目的资源文件夹中放置一张命名为“filter.jp

PS AI修图免费平替来了！Stability AI又放大招，核弹级更新一键扩图Jun 12, 2023 pm 07:27 PM

此前，PS的重建图像功能就让人无比振奋，让无数人惊呼今天，StabilityAI又放大招了。它联合Clipdrop推出了UncropClipdrop——一个终极图像比例编辑器。从Uncrop这个名字上，我们就能看出它的用途。它是一个AI生成的「外画」工具，通过创建扩展背景，这个工具可以补充任何现有照片或图像，来更改任何图像的比例。敲黑板：通过Clipdrop网站，就可以免费试用这个工具了，无需登录！比例任意调，满意为止Uncrop基于StabilityAI的文本到图像模型StableDiffus

vue报错找不到图片怎么办Nov 19, 2022 pm 05:01 PM

vue报错找不到图片的解决办法：1、修改配置文件，将绝对路径改为相对路径；2、将图片作为模块加载进去，并将图片放到static目录下；3、将imageUrls引入响应的vue文件中，解析引用即可。

AI去除马赛克，可还行？Apr 09, 2023 pm 07:11 PM

哈喽，大家好。你有没有想过用 AI 技术去除马赛克？仔细想想这个问题还挺难的，因为我们之前使用的 AI 技术，不管是人脸识别还是OCR识别，起码人工能识别出来。但如果给你一张打上马赛克的图片，你能把它复原吗？显然是很难的。如果人都无法复原，又怎能教会计算机去复原呢？还记得前几天我写的一篇《用AI生成头像》文章吗。在那篇文章中，我们训练了一个DCGAN模型，它可以从任意随机数生成一个图像。随机数作为像素生成的噪声图模型从随机数生成正常头像DCGAN包含生成器模型和判别器模型两个模型组成，生成

php写图片不显示不出来怎么办Nov 14, 2022 am 10:17 AM

php写图片不显示不出来的解决办法：1、找到并打开php.ini文件；2、找到“extension=php_gd2.dll”，并将前面的分号去掉；3、重新启动服务器；4、在绘图前清一下缓存即可。

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

1 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Where to find the Crane Control Keycard in Atomfall

1 weeks agoByDDD

Hot Tools

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),