Mingmin from Aofeisi Quantum Bits | Official Account QbitAI's DALL-E 2, which is popular all over the world for its superb painting level, has been questioned. For example, the polysynonym of bat was passed. a bat is flying over a baseball stadium.

2025/06/1214:01:36 education 1711

Mingmin from Aofeisi
Quantum bits | Official account QbitAI

DALL-E 2, which is popular all over the world with its superb painting level, has been questioned.

For example, the polysynonym of bat, it passed the exam.

a bat is flying over a baseball stadium (an bat /ball bat flies over the baseball field).

As a result, the pictures it draws, the bats and rackets are flying in the sky.

And this is not an accidental mistake. If you enter "a person is hearing a bat", the bat and bat are drawn.

to another case, enter a fish and a gold ingot.

is OK. You just cast both things into gold and turn them into real gold fish.

cannot underestimate these mistakes, because they mean that DALL-E 2 has a basic mapping relationship between symbols and entities in the language during the process of generating images from text.

means that a word corresponds to an entity.

Take bat as an example. Drawing a bat or a ball stick is considered to be DALL-E 2 correctly understood, but if both are given, there is a problem.

It’s like a single choice question itself. Fill in A or B is correct, but writing both of them violates the rules.

What's more, sometimes it mistakes the modifiers of different objects, "The solution to the previous question is used on the next one."

discovered this problem by scholars from Bayilan University and the Allen Institute of Artificial Research and Intelligence, and wrote a paper to analyze it specifically.

Interestingly, researcher Yoav Goldberg also mentioned that this situation is not common in mini DALL-E and Stable Diffusion.

I guess this may be due to the so-called inverse scaling phenomenon.

is simply understood as "the larger the model, the worse the performance." What exactly does the

paper say?

After discovering the problem, several scholars conducted repeated experiments and divided the problem into three situations:

First, a word is interpreted as two different things
Second, a word is interpreted as two different things
Third, a word is interpreted as one thing while being interpreted as one thing, and a word is understood as another thing

The first two situations have been mentioned at the beginning.

For example, if you enter "one zebra and a street", there will always be zebra crossings in the output result.

Here, DALL-E 2 explained the zebra twice at the same time.

After repeated experiments for these cases, the author calculated that in the three cases, the probability of errors in DALL-E 2 is more than 80% .

The second case has the highest error rate, reaching 97.2%.

In the third case, if a new modifier is added to another noun, mistakes can be avoided.

means entering a zebra and a gravel road, and no zebra crossing appears on the road surface.

And these repeated explanations are not common when using DALL-E mini and Stable Diffusion.

The author explained that in the future, we can consider studying the text codec of the model to trace these problems, and we can study whether these problems are related to the model size and framework. Yoav Goldberg, one of the authors of

, is an outstanding professor at Bayilan University and director of research at the Israel Branch of the Allen Institute of Artificial Intelligence. Before

, he worked as a postdoctoral fellow at the Google Research Center in New York. The research interests are NLP and machine learning, especially in grammatical analysis.

also discovered the DALL-E 2 self-created language

. But just a few months ago, a doctoral fellow in computer science discovered that feeding DALL-E 2 some strange languages can also generate images of the same type.

and these words are from the DALL-E 2 generated image.

For example, after entering "Two farmers talking about vegetables, with subtitles", some "garbled" words appear in the image given by DALL-E 2.

And if the new word Vicootes in the image is thrown to the model as a description, unexpectedly, a bunch of images appear:

has radish , pumpkins, and small persimmons... Can "Vicoots" represent vegetables?

If you throw a string of "Apoploe vesrreaitiis" in the bubble above to DALL-E 2, a bunch of bird pictures appear:

"Can you say that this word represents 'bird', so farmers seem to be talking about the birds that affect their vegetables? "

At that time, after this doctoral fellow posted his discovery on the Internet, it immediately caused heated discussion.

Some people tried to analyze how DALL-E 2 encrypts the language, and some people thought it was just noise.

But in general, in terms of language understanding, DALL-E 2 You can always make something unexpected.

What do you think is the reason behind this?

Paper address:
https://arxiv.org/pdf/2210.10606.pdf

Reference link:
https://twitter.com/yoavgo/status/1583088957226881025

— End —

Quantum bit QbitAI · Toutiao account signing

education

Recently, are all children in the family participating in anti-drug knowledge competitions? Today I have compiled three learning methods for you: simple, easy to use, rich in scientific basis and practical fun. It can not only be used in this anti-drug competition but also applic

Want to get full marks in the anti-drug competition? These learning methods will help you achieve twice the result with half the effort!

06/13 1421

Dear comrades and netizens, I just saw a friend on the Internet who said something, that is, he received a notice that the teacher qualification examination was cancelled. Judging from his situation, it is indeed due to the epidemic. I finally wanted to register for the teacher q

I just received a notice that the teacher qualification examination was cancelled because of the epidemic

06/13 1360

At 17:00 p.m. on October 23, the city's 2022 self-study examination for higher education in the second half of 2022 was successfully completed, and the entire examination process was safe, stable and orderly. During the examination, the Municipal Education Bureau, Municipal Publi

Xi'an City's 2022 Higher Education Self-Study Examination ended smoothly

06/13 1652

Many parents said that they have read so many experiences and strategies about applying for overseas universities, but they will still make mistakes they should make when applying. Instead of falling into the dilemma of "I understand all the truth but I just don't know", it is be

As the application season is approaching, what are the misunderstandings about Shenben Shenyan?

06/13 1742

——Exclusive interview with the 2022 military training performance square instructor group of Xiangtan University suddenly felt like a night of winter wind, and he was wearing thousands of clothes and trousers.

When you wear camouflage, you will still be proud when you return

06/13 1395

The current educational atmosphere is very strange, and it is increasingly showing the characteristics of "polarization". Some parents spoil their children in the name of "happy education" and let them go. As long as the children are happy, they can act recklessly; while other pa

Happy education means doing nothing, letting children go with the flow?

06/13 1301

In order to summarize and refine the experience and practices of Dancheng's first and second high schools over the past five years, and in response to the "Action Plan for the Development and Improvement of County General High Schools in the 14th Five-Year Plan" issued by the Min

Liu Chengzhang, the principal of a high school in Dancheng, provides the "Dancheng Plan" for the revitalization of the county center

06/13 1265

It is recommended to use Chrome browser, Firefox browser, Opera browser, QQ browser, Sogou browser, Internet Explorer 11, etc.

2023 Henan Province General College Entrance Examination Registration Information Collection Process and Operation Manual

06/13 1279

Recently, the Provincial Education Examination Institute issued the "Notice on Doing a Good Job in the Registration for the College Entrance Examination of Secondary Vocational Education in Jiangsu Province in 2023". Let's take a look! The registration targets are graduates of pr

Jiangsu Safety and Technical Vocational College: Registration for Jiangsu Province's 2023 secondary vocational education college entrance examination is about to begin

06/13 1963

The information in the "2023 Secondary Vocational Vocational Education College Entrance Examination Registration Information Collection Form" is an important part of the candidate's electronic files and is one of the important basis for online registration and admission.

Jiangsu Safety and Technical Vocational College: Online registration and confirmation time for the 2023 secondary vocational education college entrance examination

06/13 1257