File:Robot hand trained with human feedback 'pretends' to grasp ball.ogg

No higher resolution available.

Robot_hand_trained_with_human_feedback_'pretends'_to_grasp_ball.ogg ‎(Ogg Theora video file, length 4.2 s, 320 × 320 pixels, 205 kbps, file size: 106 KB)

Summary edit

Media data and Non-free use rationale
Description	An AI system learns to pretend to grasp an object by placing the hand between the camera and the object. So it receives positive feedback from its user.
Author or copyright owner	Dario Amodei, Paul Christiano, Alex Ray
Source (WP:NFCC#4)	Original publication: Where: https://openai.com/blog/deep-reinforcement-learning-from-human-preferences/ When: 21 December 2016 How: As part of a blog post Immediate source: https://openai.com/content/images/2017/06/gifhandlerresized.gif
Date of publication	13 June 2017
Use in article (WP:NFCC#7)	AI alignment
Purpose of use in article (WP:NFCC#8)	This GIF illustrates what happens when an AI system is trained by human feedback. The system learns to fool the human into giving positive feedback. The fallibility of human feedback is a central problem in scalable supervision.
Not replaceable with free media because (WP:NFCC#1)	Other examples of unintended AI behavior are not from AI systems trained with human feedback. This is because human feedback is not widely used yet. Furthermore, other examples also do not have a free use license either. I have gone through the largest list of examples to confirm this: https://docs.google.com/spreadsheets/d/e/2PACX-1vRPiprOaC3HsCf5Tuum8bRfzYUiKLRqJmbOoC-32JorNdfyTiRRsR7Ea5eWtvsWzuxo8bjOxCG84dAg/pubhtml The authors of such examples do not seem to be interested in attaching a free-use license to their video uploads. A replacement cannot be created on purpose because unintended AI behavior is unintended - i.e. not on purpose.
Minimal use (WP:NFCC#3)	The file will be used in only one article. It shows a screenshot clip of only a few seconds.
Respect for commercial opportunities (WP:NFCC#2)	The content was created by OpenAI Nonprofit. This is a research blog post from a research organization. The content is not related to any commercial product. It was released as part of a blog post by the authors who wanted to illustrate the dangers of training AI by human feedback.
Fair useFair use of copyrighted material in the context of AI alignment//en.wikipedia.org/wiki/File:Robot_hand_trained_with_human_feedback_%27pretends%27_to_grasp_ball.oggtrue

Licensing edit

This is a sample from a copyrighted video recording. The person who uploaded this work and first used it in an article, and subsequent people who use it in articles, assert that this qualifies as fair use under United States copyright law when used on the English-language Wikipedia, hosted on servers in the United States by the non-profit Wikimedia Foundation, where:

The sample is being used for commentary on the video recording in question, and contributes significantly to the encyclopedia articles it is used in (listed under the heading "File links" below) in a way that cannot be duplicated by other forms of media.
The sample is short in relation to the duration of the recorded track and is of an inferior quality to the original recording.
There is no adequate free alternative available.
No other samples from the same recording are currently used in Wikipedia;
A more detailed fair use rationale may be provided by the user who uploaded this recording.

Any other uses of this recording, on Wikipedia or elsewhere, may be copyright infringement. If you are the copyright holder of this recording and you feel that its use here does not fall under "fair use" please see Wikipedia:Copyright problems for information on how to proceed.
Fair use

To the uploader:

Please add a detailed non-free use rationale for each article the image is used in, which must also declare compliance with the other parts of the non-free content criteria, as well as the source of the work and copyright information.
For example non-free use rationales, see Wikipedia:Use rationale examples.
This tag should only be used for video extracts, do not use it for other purposes.

To patrollers and administrators: If this image has an appropriate rationale please append |image has rationale=yes as a parameter to the license template.

File history

Click on a date/time to view the file as it appeared at that time.

	Date/Time	Thumbnail	Dimensions	User	Comment
current	12:05, 9 September 2022		4.2 s, 320 × 320 (106 KB)	SoerenMind (talk \| contribs)	Uploading a non-free file using File Upload Wizard

You cannot overwrite this file.

File usage

The following pages on the English Wikipedia use this file (pages on other projects are not listed):

AI alignment

Transcode status

Update transcode status

Format	Bitrate	Status	Encode time
VP9 240P	32 kbps	Completed 18:47, 18 February 2024	2.0 s
Streaming 240p (VP9)	32 kbps	Completed 18:18, 29 February 2024	2.0 s
WebM 360P	61 kbps	Completed 07:28, 31 October 2023	1.0 s
Streaming 144p (MJPEG)	183 kbps	Completed 07:29, 31 October 2023	1.0 s

Metadata

This file contains additional information, probably added from the digital camera or scanner used to create or digitize it.

If the file has been modified from its original state, some details may not fully reflect the modified file.

Software used	Lavf59.29.100 Lavc59.39.100 libtheora