Mingmin Xiaoxiao from Aufei Temple
Qubit Report丨Public Account QbitAI
Finally, I A childhood dream has come true!
Just need me to take a picture of mine Handwriting, AI can help me transcribe English homework, the kind of "exactly the same" painting style:
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/6a0711f0-1266-47d9-9bfa-980e4ca32651.jpg)
I didn’t even copy homework for others Question...
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/94fb963c-db36-4220-b2c4-9176592b0324.jpg)
Simply hanging a batch can only An "artifact of homework" that imitates handwriting and costs hundreds or thousands of dollars.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/847182c8-7ac8-4ae3-ac70-c33b42770d17.jpg)
Ahem, focus:
Although the function is very powerful , but this is not for copying English homework for you. (You have to do your homework seriously!)
This is< /span>Facebook AIThe latest "Text Style Brush"(TextStyleBrush), it only needs a photo of handwriting to perfectly restore a whole set of text handwriting.
Not only can move flowers and plants, but also " "soy sauce bottle" becomes "teapot":
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/f35cdf11-7ac0-451e-8269-51db39f50deb.jpg)
It can also be implemented directlyStyle replacement, so that all printed words in the fruit and vegetable shop become handwritten :
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/fc171aa5-a5e0-43d1-8102-c9d7f5aa2bae.jpg)
In this way, even now Photo text is not necessarily real anymore.
Stronger than format brush: text can also be changed
In actual use, TextStyleBrush is really a Format brush, wherever you need to brush.
Its real power is to simulate handwriting font.
Just enter some text, Add your handwriting, just 1 word, and it can generate a "handwritten version".
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/3a8ccd51-bbb2-4b52-84ef-1f7fcaf5cb52.jpg)
This effect can be seen with naked eyes What's more, you can't tell the truth from the fake!
Print the price tag in the market In the process of changing all fonts into handwritten, it can also recognize samples that are not printed, and automatically skip the conversion and synthesis.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/63d87c16-ba3c-449c-a165-b9ae739a52b2.jpg)
△The two handwritten labels have not been changed
When simulating a specific font format, TextStyleBrush also performs very well.
Including posters, trash cans, street signs , beverage bottles, storefront decorations... various text styles can be handled:
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/3780317c-e05e-4493-848b-c1e3b7accb51.jpg)
In addition to intuitive effects, developers Data analysis was also done on the synthesized images.
The image generated by TextStyleBrush is synthetically wrong (MSE) is greatly reduced, and the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) are also improved a lot.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/c852dc56-575d-4135-880e-06cd1c696a12.jpg)
On the accuracy of text recognition , TextStyleBrush performed well in the three sets of data sets:
The accuracy rate is as high as95% above.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/32ce84ee-355d-47ce-a06e-da451d9def52.jpg)
Using GAN to change it, it is difficult to distinguish true and false text
According to Facebook, TextStyleBrush is a text style brush based on< span style="color: #00997F; --tt-darkmode-color: #00997F;">Self-supervisedThe model trained by the method can perform style conversion on text with the same text content, just like The format brush is the same.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/e82166d4-0e80-4518-88b2-67f8b9094c79.jpg)
Of course, not just the format of Word Brush, it can even directly replace the text in the photo, so the model also needs to learntext Methods for identifying and image segmentation.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/0117b138-f6ee-46fa-8668-6e9ea7b31285.jpg)
△Backlight scene is also easy
In order to achieve image segmentation and text style conversion at the same time, the TextStyleBrush model is based onStyleGAN2 designed to generate very photorealistic images.
However, there are two problems with StyleGAN2 :
- First of all, the way it generates images is "randomly hitting", that is, there is no way to control the characteristics of the output image. But TextStyleBrush must generate an image of specified text.
- Secondly, the overall style of StyleGAN2 Uncontrolled, but the style in TextStyleBrush involves a large amount of information combination, including features such as color, scale and style conversion, and even handwriting details with personal characteristics.
To this end, TextStyleBrush firstly controls the output of the model by using text information and style as two "additional conditions" to solve the problem of random image generation by the model.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/e6ee55eb-dd34-436b-8e54-afa8355268b1.jpg)
Then, to further refine To control the style characteristics of the text, various style information in the neural network layer will also be extracted, and these information will be injected into the text generator, so as to control the style of the text from various scales (color, overall style, details).
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/1cab1b0a-c775-4089-b55c-8b7d7f8ebd81.jpg)
Besides, due to different The image resolution of the region is different, and the generator must also generate text with a similar resolution to the replacement region.
To this end, the model is added In order to control the structure of high and low resolution, the generated text image can match the resolution of the input image.
Like this, before and after replacement There will be no problems with large differences in font clarity:
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/203fc0d3-150d-4db5-97cf-93d3b1aeb26e.jpg)
But unlike photos, text The style is actually morefree, so sometimes the painting style The authenticity is hard to say.
For this, when training , Facebook introduced an innovative self-supervised training method, combining three models of style classification, text recognition (OCR) and GAN to preserve the input style/text content, and then decide which one to replace.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/9441186f-2cb8-49fc-bbb0-2599758288c3.jpg)
For example, in text recognition, After letting TextStyleBrush generate a text image, the model will use a pre-trained text recognition structure to "judge" the text content of the image and give it a score.
It turns out that training like this The model is really useful.
Netizen: Disgusting the real? I'm really worried...
Synthesizing human faces has been played too much, but it is the first time to synthesize handwriting.
And it really works good!
So, once the TextStyleBrush is released, it will It attracted many people to watch.
Some netizens have already begun to imagine it Used for:
Welcome to< /span>The world of fancy signature!
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/b94ff40c-8699-40dc-a0cd-4dda17ac4383.jpg)
LeCun also forwarded a wave.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/8d707760-75a6-4fce-ba19-a3bb0525450d.jpg)
However, it is really hard to see or not to play It was too uncomfortable, and netizens with itchy hands came to ask questions:
TextStyleBrush will be public Is it open for use?
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/0ba28908-e244-4456-bdb8-3d195a949400.jpg)
This naturally leads to a Points that will cause controversy:
The handwriting after synthesis is enough to look like real ones, What if it is misused or used maliciously?
Assume that any one's handwriting is It can be synthesized very easily, so what should we do in many occasions that need to be signed?
For example, some netizens said that if Even doctors imitate their "cursive" prescriptions… …
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/0ab83f95-13db-481a-9cc9-fef942cdfbd7.jpg)
In addition to security and privacy issues Worry, it's not a problem for type designers good news.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/e6b5c1c8-8d62-4af0-87fb-20c77e5f7e17.jpg)
After all, all fonts are actually If there is copyright, if it can be easily simulated, wouldn't it bepiracy< /strong>Flying all over the sky, even the author himself can't tell the truth from the fake.
Some netizens said: This is far from The dystopian world where real and fake are a little bit closer...
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/8157862b-031e-4865-99cf-26d2b8b74459.jpg)
In this regard, Facebook's CTO made responded:
Because it may be It is used to forge handwriting, so we only publish papers and data sets, and the source code will not be open source .
share research and datasets, and More to prevent text versions of Deepfakes.
![](http://daogezhiyuan-article.oss-cn-beijing.aliyuncs.com/win3000/pic/16fecc26-4a5f-4b6b-94c1-96a045b86665.jpg)
What do you think?
TextStyleBrush dataset:
https://github.com/facebookresearch/IMGUR5K-Handwriting-Dataset
Paper URL:
https://scontent-fml2-1.xx.fbcdn.net/v/t39.8562-6/ 10000000_944085403038430_3779849959048683283_n.pdf
— END —
Qubit QbitAI Toutiao signed a contract
Follow us, the first time to learn about cutting-edge technology trends
Articles are uploaded by users and are for non-commercial browsing only. Posted by: Lomu, please indicate the source: https://www.daogebangong.com/en/articles/detail/AI%20high%20imitation%20of%20your%20handwriting%20only%20needs%201%20word%20Deepfake%20text%20version%20is%20here%20netizens%20fake%20ones.html
评论列表(196条)
测试