US Patent No. 10,769,495


Patent No. 10,769,495
Issue Date September 08, 2020
Title Collecting Multimodal Image Editing Requests
Inventorship Trung Huu Bui, San Jose, CA (US)
Zhe Lin, Fremont, CA (US)
Walter Wei-Tuh Chang, San Jose, CA (US)
Nham Van Le, San Jose, CA (US)
Franck Dernoncourt, Sunnyvale, CA (US)
Assignee Adobe Inc., San Jose, CA (US)

Claim of US Patent No. 10,769,495

1. In a digital medium environment to create multimodal image editing requests, a method implemented by a computing device, the method comprising:displaying a pair of images including a first image and a second image that includes an edit of the first image;
presenting an option to skip the pair of images, the option including that the first image and the second image are too similar;
recording a multimodal image editing request that includes a user gesture and a voice command that describe the edit of the first image;
receiving a user transcription of the voice command;
generating a data object in a searchable format, the data object including the voice command, the user gesture, and the user transcription; and
training a neural network to recognize the multimodal image editing requests using the data object and the first image as inputs of the neural network and the second image as an output of the neural network.