US Pat. No. 10,531,150

METHOD AND SYSTEM FOR DETECTING AND RESPONDING TO CHANGING OF MEDIA CHANNEL

Gracenote, Inc., Emeryvi...

1. A method of detecting and responding to a change of channels at a media presentation device, the method comprising:while the media presentation device is presenting media content that was determined to be of a first channel, detecting a threshold mismatch between (i) a query fingerprint stream representing the media content being presented by the media presentation device and (ii) a reference fingerprint stream representing media content of the first channel, wherein the media content being presented by the media presentation device was determined to be of the first channel by comparing the query fingerprint stream with the reference fingerprint stream using a first fingerprint comparison rate and, based on the comparing, detecting a match between the query fingerprint stream and the reference fingerprint stream, wherein detecting the threshold mismatch is done by comparing the query fingerprint stream with the reference fingerprint stream using a second fingerprint comparison rate greater than the first fingerprint comparison rate, and wherein the threshold mismatch indicates a likelihood that the media presentation device has transitioned from presenting the first channel to presenting a second channel;
responsive to detecting the threshold mismatch, changing an overlay on the presented media content, wherein changing the overlay involves removing superimposed content associated with the first channel.

US Pat. No. 10,893,320

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A system, comprising:a display;
memory that stores instructions and a plurality of templates; and
one or more processors of a client device configured by the instructions to perform operations comprising:
accessing a video input stream that includes first video content from a content provider and second video content generated by a set-top box device upstream of the one or more processors, wherein, in a frame of the video input stream, the first video content corresponds to a first screen portion of the display and the second video content corresponds to a second screen portion of the display;
generating, from the first video content, a query fingerprint by a query fingerprint generator of the client device according to an image of the video input stream;
accessing, based on a comparison of the query fingerprint and a reference fingerprint generated by a reference fingerprint generator, replacement video content provided by a replacement content source, wherein the replacement video content differs from the first video content, wherein the replacement video content is accessed separately from the video input stream;
identifying, based on the set-top box device, a template of the plurality of templates, wherein the identified template indicates that the second screen portion comprises an overlay for displaying the second video content generated by the set-top box device;
comparing the video input stream to the identified template to determine that the frame of the video input stream corresponds to the identified template;
responsive to a determination that the frame of the video input stream corresponds to the identified template, generating a video output stream comprising the second video content for the second screen portion and the video replacement content for the first screen portion to mimic a presentation of the first video content and the second video content in the video input stream; and
causing the video output stream to be presented on the display, wherein comparing the video input stream to the identified template comprises:
downsampling the frame of the video input stream to form a downsampled frame,
determining a cross-correlation between the downsampled frame and the identified template,
performing a comparison of the cross-correlation to a threshold, and
determining, based on the comparison, that the frame of the video input stream corresponds to the identified template.

US Pat. No. 10,742,702

SAVING MEDIA FOR AUDIO PLAYOUT

Gracenote Digital Venture...

1. A computer-implemented method comprising:receiving, by a profile server device and from a client device, an indication of a text-based article that has been flagged for later audible playout by way of the client device, wherein the client device is associated with a user profile;
in response to the text-based article having been flagged for later audible playout, causing, by the profile server device, the text-based article to be converted into an audio file;
adding, by the profile server device, the audio file to a playlist associated with the user profile;
receiving, by the profile server device and from a playout device, a request for the playlist, wherein the request includes user-distinguishing information;
determining, by the profile server device, that the user profile is associated with the user-distinguishing information; and
in response to: (i) the request for the playlist and (ii) the user profile being associated with the user-distinguishing information, causing, by the profile server device, the playlist to be transmitted to the playout device.

US Pat. No. 10,701,410

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICE

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device that is configured to present media content sequences, the method comprising:receiving, by the client device, one or more reference fingerprints, wherein the one or more reference fingerprints is associated with channel data for a particular channel;
receiving, by the client device, one or more media content sequences;
generating, by the client device, one or more fingerprints of the received one or more media content sequences;
detecting a match between (i) the received one or more reference fingerprints and (ii) the generated one or more fingerprints; and
responsive to detecting the match, sending, by the client device, to one or more server devices, a message that comprises the channel data associated with the particular channel, wherein the particular channel is different from a former channel that the client device previously determined that it was tuned to based on one or more additional reference fingerprints received by the client device.

US Pat. No. 10,523,999

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON MATCHING WITH DIFFERENTIAL REFERENCE-FINGERPRINT FEATURE

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive video content to be presented by the media presentation device, wherein the video content includes video frames having video frame regions, the video frame regions comprising a center, an edge, and a corner;
a media presentation interface for presenting the received video content;
a network communication interface;
a processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the processing unit to carry out operations comprising:
generating a first query fingerprint of the video content based on analysis of the video content, and outputting the generated first query fingerprint for transmission through the network communication interface to a server,
receiving from the server, in response to the first query fingerprint, a request for a second query fingerprint specifically focused on an identified video frame region of the video frame regions of the video content, wherein the server determines that the first query fingerprint matches multiple reference fingerprints corresponding with multiple channels and, in response to the determining, identifies the video frame region, the identifying being based on the video frame region defining a difference between the channels corresponding with the multiple reference fingerprints,
outputting, for transmission through the network communication interface to the server, the requested second query fingerprint specifically focused on the identified video frame region of the video content, and
causing the media presentation device to present, in conjunction with the video content that the media presentation device is presenting, supplemental channel-specific content associated with one of the multiple channels, wherein the server identifies the one channel by determining that the second query fingerprint matches a reference fingerprint of the video frame region of just the one channel of the multiple channels, and wherein the media presentation device receives from the server the supplemental channel-specific content for the media presentation device to present in conjunction with the video content that the media presentation device is presenting.

US Pat. No. 10,869,095

METHOD AND SYSTEM FOR PRESENTING ADDITIONAL CONTENT AT A MEDIA SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, a first query fingerprint for comparison with a plurality of reference fingerprints, wherein the first query fingerprint represents media content being received by a client device;
identifying, by the computing system, a match between the first query fingerprint and a first reference fingerprint of the plurality of reference fingerprints;
determining, by the computing system, that a distinctiveness of the media content represented by the first query fingerprint is below a threshold distinctiveness;
responsive to identifying the match and determining that the distinctiveness of the media content represented by the first query fingerprint is below the threshold distinctiveness, (i) incrementing, by the computing system, a match counter and (ii) performing, by the computing system, a match verification process comprising:
(i) receiving an additional query fingerprint representing the media content being received by the client device;
(ii) identifying, by the computing system, an additional match between the additional query fingerprint and an additional reference fingerprint of the plurality of reference fingerprints; and
(iii) responsive to identifying the additional match, incrementing the match counter;
repeating, by the computing system, the match verification process until the match counter reaches a threshold value; and
after the match counter reaches the threshold value, using, by the computing system, the identified matches to identify the media content being received by the client device.

US Pat. No. 10,762,889

REAL TIME POPULARITY BASED AUDIBLE CONTENT ACQUISITION

Gracenote Digital Venture...

1. A method comprising:analyzing, by a computing device, generated playlists for a plurality of client devices, wherein the generated playlists contain ordered references to text-to-speech-generated audio files of media content items respectively customized for the client devices;
determining, by the computing device, that a media content item has a threshold level of popularity based on a number of references to a text-to-speech-generated audio file of the media content item that are included in the generated playlists;
responsive to determining that the media content item has the threshold level of popularity, transmitting, by the computing device and to a remote recording station, a request for a human verbal reading of the media content item;
receiving, by the computing device and from the remote recording station, an audio file of the human verbal reading;
updating, by the computing device, the references to the text-to-speech-generated audio file of the media content item to refer to the audio file of the human verbal reading; and
transmitting, by the computing device, a playlist to a client device of the plurality of client devices, wherein the playlist includes a plurality of ordered references to audio files of media content items, including the audio file of the human verbal reading.

US Pat. No. 10,754,891

INSERTING INFORMATION INTO PLAYING CONTENT

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:a content break module to:
identify, at a playback device, real-time content by comparing a fingerprint of the real-time content to a plurality of reference fingerprints, the plurality of reference fingerprints corresponding to a plurality of different media; and
identify, at the playback device, a break in the real-time content playing via the playback device based on an identification of the real-time content;
an information selection module to select a first information segment to present during the break based on (1) a first priority of the first information segment being higher than a second priority of a second information segment and (2) a determination that the break is a bounded break; and
an information insertion module to insert the first information segment into the real-time content playing via the playback device upon an occurrence of the break.

US Pat. No. 10,412,447

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A system comprising:a memory having instructions embodied thereon; and
one or more processors configured by the instructions to perform operations comprising:
accessing, from an intermediate device upstream of the one or more processors, a video input stream that includes first content and a semi-transparent overlay overlaid at least partially on the first content in an overlay zone, wherein the first content is from a content provider and the semi-transparent overlay is overlaid on the first content by the intermediate device;
accessing a template that includes an indication of the overlay zone in the video input stream;
analyzing, using the template, the video input stream to determine that the video input stream includes the semi-transparent overlay in the overlay zone;
accessing replacement content for replacing the first content outside of the overlay zone;
while accessing the video input stream, generating a video output stream that includes a modified semi-transparent overlay overlaid on the replacement content in the overlay zone; and
causing the video output stream, including the replacement content outside of the overlay zone and the modified semi-transparent overlay in the overlay zone, to be presented on a display.

US Pat. No. 10,318,579

INSERTING INFORMATION INTO PLAYING CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:identifying, at a playback device, by executing an instruction with a processor, real-time content by comparing a fingerprint of the real-time content to a plurality of reference fingerprints, the plurality of reference fingerprints corresponding to a plurality of different media;
identifying, at the playback device by executing an instruction with the processor, a break in the real-time content playing via the playback device based on the identification of the real-time content;
assigning, by executing an instruction with the processor, a ranking to multiple information segments, the ranking based on a priority associated with a timeliness of an information segment relative to other information segments;
selecting, by executing an instruction with the processor, an information segment to present during the break based on the assigned rankings; and
inserting, by executing an instruction with the processor, the information segment into the real-time content playing via the playback device upon an occurrence of the break by playing the information segment over the real-time content on the playback device.

US Pat. No. 10,225,605

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION BASED ON REFERENCE STREAM COMPARISON

Gracenote, Inc., Emeryvi...

1. A method comprising:detecting by a computing system, based on a comparison of reference fingerprints with each other, that a plurality of the reference fingerprints match each other, the plurality of reference fingerprints defining a multi-match group, wherein each reference fingerprint represents a respective reference media stream carried by a respective known channel of media content;
after detecting by the computing system that the plurality of reference fingerprints match each other, detecting by the computing system that a query fingerprint representing a media stream being rendered by a media presentation device matches a given one of the reference fingerprints of the plurality;
based on the computing system having detected that the query fingerprint matches the given reference fingerprint of the plurality, and based on the computing system having detected that the plurality of reference fingerprints match each other, determining by the computing system that the query fingerprint matches each reference fingerprint of the plurality;
responsive to the determining that the query fingerprint matches each reference fingerprint of the plurality, using by the computing system one or more attributes of the query fingerprint as a basis to identify a reference fingerprint of the plurality that best matches the query fingerprint;
determining by the computing system, based on the identified reference fingerprint, what channel carries the media stream being rendered by the media presentation device, wherein determining what channel carries the media stream being rendered by the media presentation device comprises determining, as the channel that carries the media stream being rendered by the media presentation device, the channel on which the reference media stream represented by the identified reference fingerprint is carried; and
taking action by the computing system based on the determined channel.

US Pat. No. 11,061,960

GENERATING AND DISTRIBUTING PLAYLISTS WITH RELATED MUSIC AND STORIES

Gracenote, Inc., Emeryvi...


1. A method comprising:based on a profile associated with a client device, selecting, by a server device, an audio file containing a story, wherein selecting the audio file containing the story involves selecting a text file containing the story and converting, by an automatic text-to-speech system, the text file containing the story to the audio file containing the story, and wherein the client device and the server device are in communication with one another by way of a wide-area network, and wherein the client device includes an audio player application;
identifying, by the server device, a keyword from the story;
based on the keyword, selecting, by the server device, an audio file containing music, wherein the keyword appears in text associated with the audio file containing the music;
automatically generating, by the server device, a playlist for the client device, wherein the playlist sequentially includes (i) a reference to the audio file containing the music, and (ii) a reference to the audio file containing the story; and
transmitting, by the server device over the wide-area network, the playlist to the client device for playout by the audio player application.

US Pat. No. 10,997,985

AUDIO PLAYOUT REPORT FOR RIDE-SHARING SESSION

GRACENOTE, INC., Emeryvi...

1. A computing device comprising:a microphone;
one or more processors; and
a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the computing device to perform a set of acts comprising:
determining that a ride-sharing session is active;
in response to determining the ride-sharing session is active; using the microphone of the computing device to capture audio content;
identifying reference audio content that has at least a threshold extent of similarity with the captured audio content;
determining that the ride-sharing session is inactive; and
outputting an indication of the identified reference audio content.

US Pat. No. 10,986,399

MEDIA CONTENT IDENTIFICATION ON MOBILE DEVICES

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:obtaining, by a computing device, a fragment of media content, wherein the fragment of media content is a representation of a sequence of media content that is presented by a playback device;
generating, by the computing device, a query fingerprint using the fragment of media content;
determining, by the computing device, that the query fingerprint does not match any reference fingerprints of a set of reference fingerprints, wherein the set of reference fingerprints is stored in a memory of the computing device, and wherein the set of reference fingerprints corresponds to the sequence of media content;
based on the determining that the query fingerprint does not match any reference fingerprints of the set of reference fingerprints, incrementing, by the computing device, a counter;
determining, by the computing device, that a number stored by the counter satisfies a threshold condition; and
based on the determining that the number satisfies the threshold condition, transmitting, by the computing device to a server device, the query fingerprint for comparison with reference fingerprints of a reference database.

US Pat. No. 10,970,327

SELECTING BALANCED CLUSTERS OF DESCRIPTIVE VECTORS

GRACENOTE, INC., Emeryvi...

1. A method comprising:accessing, by one or more processors, descriptive vectors that describe items, each descriptive vector comprising one or more values indicative of an extent to which one or more characteristics are present in a respective item of the items;
determining, by the one or more processors, one or more vector distances between one or more pairs of the descriptive vectors;
generating, by the one or more processors, a hierarchy of vector clusters by clustering the descriptive vectors into the vector clusters based on the determined one or more vector distances;
determining, by the one or more processors, centroid vectors of the vector clusters in the hierarchy, each centroid vector corresponding to a respective vector cluster in the hierarchy;
summing, by the one or more processors, one or more inter-cluster vector distances between one or more pairs of the centroid vectors;
summing, by the one or more processors, for each of the vector clusters, one or more intra-cluster vector distances between one or more pairs of descriptive vectors;
determining, by the one or more processors, a plurality of scores of the hierarchy by applying a plurality of weightings to the summed inter-cluster vector distances and the summed intra-cluster vector distances, wherein each score of the plurality of scores corresponds to a respective weighting of the plurality of weightings, and wherein a particular weighting of the plurality of weightings corresponds to an extreme score of the plurality of scores; and
selecting, by the one or more processors, a subset of the vector clusters in the hierarchy based on the weighting that corresponds to the extreme score.

US Pat. No. 10,970,328

METHOD TO DIFFERENTIATE AND CLASSIFY FINGERPRINTS USING FINGERPRINT NEIGHBORHOOD ANALYSIS

Gracenote, Inc., Emeryvi...

1. A non-transitory machine-readable medium having instructions embodied thereon, which, when executed by one or more processors of a machine, cause the machine to perform operations comprising:establishing a stop-fingerprint rule for a multimedia fingerprint neighborhood analysis; and
classifying a plurality of multimedia fingerprints into unique fingerprints and non-unique fingerprints by applying the stop-fingerprint rule to each reference fingerprint selected from a plurality of reference fingerprints stored in a reference multimedia fingerprint database,
wherein applying the stop-fingerprint rule on each reference fingerprint in the reference multimedia fingerprint database comprises, for each reference fingerprint performing operations comprising:
determining, for each stop fingerprint in a stop-fingerprint database, a respective fingerprint distance between the reference fingerprint and the stop fingerprint,
determining, for the reference fingerprint, a set of neighboring fingerprints within the stop-fingerprint database, wherein the set of neighboring fingerprints comprises the stop fingerprints within the stop-fingerprint database for which the respective fingerprint distance is less than a first threshold value;
determining a quantity of stop fingerprints in the set of neighboring fingerprints;
performing a comparison of the quantity of stop fingerprints in the set of neighboring fingerprints to a second threshold value;
if the quantity of stop fingerprints in the set of neighboring fingerprints is less than the second threshold value, then classifying the multimedia fingerprint as one of the unique fingerprints; and
if the quantity of stop fingerprints in the set of neighboring fingerprints is greater than the second threshold value, then classifying the multimedia fingerprint as one of the non-unique fingerprints,
wherein the reference fingerprints are split across a plurality of processors, each processor having a part of the reference multimedia fingerprint database.

US Pat. No. 10,965,991

DISPLAYING AN ACTIONABLE ELEMENT OVER PLAYING CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:identifying video content currently playing within a display environment provided by a playback device;
determining a location within the display environment at which to display an actionable element, wherein determining the location comprises:
identifying, based on a fingerprint associated with the identified video content, an area within the display environment in which the playback device displays a navigation element native to the playback device, and
based on the identifying, selecting the identified area within the display environment as the location within the display environment at which to display the actionable element; and
in response to determining the location within the display environment at which to display the actionable element, displaying the actionable element at the determined location within the display environment, wherein the actionable element has a configuration that is based on a size of the identified area within the display environment, wherein the actionable element is based on the identified video content, and wherein the actionable element includes one or more user-selectable options to perform an action associated with the identified video content.

US Pat. No. 10,949,482

USER PROFILE BASED ON CLUSTERING TIERED DESCRIPTORS

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:memory; and
at least one processor to execute instructions to at least:
group descriptors into a cluster of descriptors based on an association between the descriptors and each of a first item and a second item;
access biometric data of a user via a user device;
determine a first activity in which the user is engaged based on contextual data that correlates the cluster of descriptors with the biometric data of the user received from the user device via the network;
determine whether the first activity represents an anomalous phase of the user based on a time period of the first activity having a duration shorter than a threshold duration;
in response to the determining that the first activity does not represent the anomalous phase of the user, generate a user profile based on the first activity of the user and the cluster of descriptors;
in response to the determining that the first activity represents the anomalous phase of the user, omit a name of the cluster descriptors from the user profile to reduce size of the profile; and
in response to a second activity of the user matching the first activity associated with the cluster of descriptors within the user profile, generate a recommendation including a third item based on the user profile.

US Pat. No. 10,944,371

METHODS AND APPARATUS FOR AUDIO EQUALIZATION

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:a volume adjuster to apply a plurality of equalization adjustments to an audio signal to generate an equalized audio signal, the plurality of equalization adjustments output from a neural network in response to an input feature set including an average volume representation of the audio signal;
a thresholding controller to:
detect an irregularity in a frequency representation of the audio signal after application of the plurality of equalization adjustments, the irregularity corresponding to a change in volume between adjacent frequency values exceeding a threshold; and
adjust a volume at a first frequency value of the adjacent frequency values to reduce the irregularity;
an equalization (EQ) curve generator to generate an EQ curve to apply to the audio signal when the irregularity has been reduced; and
a frequency to time domain converter to output the equalized audio signal in a time domain based on the EQ curve.

US Pat. No. 10,887,031

VEHICLE-BASED MEDIA SYSTEM WITH AUDIO AD AND NAVIGATION-RELATED ACTION SYNCHRONIZATION FEATURE

GRACENOTE, INC., Emeryvi...

1. A vehicle-based media system comprising:a user interface;
a microphone;
one or more processors; and
a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the vehicle-based media system to perform a set of operations comprising:
receiving audio content;
causing one or more speakers to output the received audio content;
using the microphone of the vehicle-based media system to capture the output audio content;
identifying reference audio content that has at least a threshold extent of similarity with the captured audio content;
identifying a geographic location associated with the identified reference audio content; and
based at least on the identified geographic location associated with the identified reference audio content, outputting, via the user interface of the vehicle-based media system, a prompt to navigate to the identified geographic location.

US Pat. No. 10,866,988

IDENTIFYING MEDIA CONTENT VIA FINGERPRINT MATCHING

Gracenote, Inc., Emeryvi...

1. A method comprising:generating, by one or more processors of a client device, a query fingerprint based on a subtraction of a spatial subregion within a spatial region of a query media frame from at least one other spatial subregion of the spatial region of the query media frame, wherein the query media frame is a video frame within video content received by the client device, wherein the client device comprises a display configured to display the video content, wherein a pillar bar is identified in the spatial region and the identified pillar bar is excluded from the spatial region prior to generating the query fingerprint;
comparing, by one or more processors of the client device, the query fingerprint to a reference fingerprint generated based on a subtraction of a spatial subregion within a spatial region of a reference media frame from at least one other spatial subregion of the spatial region of the reference media frame;
determining, by one or more processors of the client device, that the query fingerprint matches the reference fingerprint; and
identifying, by one or more processors of the client device, the video content based on the determining that the query fingerprint matches the reference fingerprint.

US Pat. No. 10,798,484

METHODS AND APPARATUS FOR AUDIO EQUALIZATION BASED ON VARIANT SELECTION

GRACENOTE, INC., Emeryvi...

1. An apparatus to generate equalization adjustments for an audio signal based on a user input, the apparatus comprising:a user interface to prompt a user for the user input corresponding to a selected variant of music; and
an input feature set generator to generate an equalization input feature set, the equalization input feature set to be used by a model executor to adjust at least one weight of a neural network model to generate example equalization adjustments for the audio signal based on the user input, the equalization input feature set generated by:
generating a data structure corresponding to the user input, the data structure including a number of entries identifying the selected variant of music; and
including the data structure in the equalization input feature set.

US Pat. No. 10,484,758

COMPUTING SYSTEM WITH CONTENT-CHARACTERISTIC-BASED TRIGGER FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, media content, wherein the media content comprises a first portion and a second portion, wherein the computing system receives the first portion before receiving the second portion;
generating, by the computing system, a fingerprint of the second portion of the received media content;
determining, by the computing system, that the first portion of the received media content includes a threshold extent of non-substantive media content;
responsive to determining that the first portion of the received media content includes the threshold extent of non-substantive media content, transmitting, by the computing system, the generated fingerprint to a content identification server to identify the second portion of the received media content; and
performing an action that relates to the identified second portion of media content.

US Pat. No. 10,419,814

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON TIME OF BROADCAST

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on a time of broadcast of the media content being rendered by the media presentation device, to determine which channel the media presentation device is receiving the media content on; and
taking action based on the determined channel.

US Pat. No. 11,049,507

METHODS, APPARATUS, AND ARTICLES OF MANUFACTURE TO IDENTIFY SOURCES OF NETWORK STREAMING SERVICES

Gracenote, Inc., Emeryvi...


1. An apparatus, comprising:a signal bandwidth identifier logic circuit to identify a signal bandwidth of a received first audio signal that represents a decompressed second audio signal, the signal bandwidth identifier including:a transformer logic circuit to form a plurality of frequency spectrums for respective ones of a plurality of time intervals of the received first audio signal;
a thresholder logic circuit to identify a plurality of indices representative of cutoff frequencies of respective ones of the plurality of time intervals; and
a smoother logic circuit to determine a median of the plurality of indices, the median representative of an overall cutoff frequency of the received first audio signal; and

a source identifier logic circuit to identify a source of the second audio signal based on the identified signal bandwidth.

US Pat. No. 11,017,021

GENERATING AND DISTRIBUTING PLAYLISTS WITH MUSIC AND STORIES HAVING RELATED MOODS

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a server device, a mood of music within an audio file containing the music;
looking up, by the server device, the mood of the music in a table, wherein the table identifies, for the mood of the music, other moods from a plurality of moods including those that are opposed to the mood of the music;
selecting, by the server device, an audio file containing a story, wherein the audio file containing the story is associated with a story attribute indicative of a second mood of the plurality of moods, wherein the second mood is not opposed to the mood of the music;
generating, by the server device, a playlist, wherein the playlist includes (i) a first reference to the audio file containing the music, and (ii) a second reference to the audio file containing the story; and
transmitting, by the server device, the playlist to a client device.

US Pat. No. 11,010,627

METHODS AND SYSTEMS FOR SCOREBOARD TEXT REGION DETECTION

Gracenote, Inc., Emeryvi...

1. A method for automatically detecting, within a digital video frame, a video frame region that depicts a textual expression of a scoreboard, the method comprising:receiving, by a computing system, the digital video frame, wherein the digital video frame depicts scoreboard image elements, wherein the depicted scoreboard image elements include the textual expression and other image elements, and wherein the computing system comprises a processing unit;
engaging, by the computing system, in an edge-detection process to detect edges of at least the depicted scoreboard image elements in the received digital video frame, wherein at least some of the detected edges (i) are of the textual expression and (ii) define alphanumeric shapes;
applying pattern-recognition, by the computing system, to identify the alphanumeric shapes, wherein applying the pattern-recognition to identify a given one of the alphanumeric shapes comprises (i) identifying one or more edges, of the detected edges, based on the one or more identified edges enclosing an area within the digital video frame and (ii) determining that the enclosed area has an alphanumeric characteristic and responsively deeming the one or more identified edges as defining the given alphanumeric shape;
establishing, by the computing system, a plurality of minimum bounding rectangles each bounding a respective one of the identified alphanumeric shapes in the digital video frame;
establishing in the digital video frame, by the computing system, based on at least two of the minimum bounding rectangles, a composite shape that encompasses the identified alphanumeric shapes that were bounded by the at least two minimum bounding rectangles, wherein the at least two minimum bounding rectangles comprise first and second minimum bounding rectangles, wherein establishing the composite shape comprises extending at least the first minimum bounding rectangle by a particular extent, and wherein the extending merges together at least the extended first minimum bounding rectangle and the second minimum bounding rectangle into the composite shape; and
based on the established composite shape occupying a particular region of the digital video frame, deeming, by the computing system, the particular region to be the video frame region that depicts the textual expression of the scoreboard.

US Pat. No. 10,997,424

METHODS AND SYSTEMS FOR SPORT DATA EXTRACTION

Gracenote, inc., Emeryvi...

1. A method comprising:receiving, by a computing system, video frames that depict a sport event, wherein the depicted sport event includes a depiction of a sport score associated with the sport event, and wherein the computing system comprises a processing unit;
engaging, by the computing system, in digital image processing of the received video frames to automatically generate sport data that indicates the sport score, wherein the digital image processing comprises:
(i) identifying a first frame region of the video frames based on the first frame region depicting a scoreboard,
(ii) executing a first machine-driven pattern recognition procedure that includes analyzing the identified first frame region to detect, within the identified first frame region, one or more second frame regions based on the second frame regions depicting text of the scoreboard,
(iii) in response to detecting the second frame regions, executing a second machine-driven pattern recognition procedure to recognize the text in at least one of the one or more second frame regions, and
(iv) based at least on the recognizing of the text, generating the sport data that indicates the sport score; and
in response to completing the digital image processing, carrying out, by the computing system, an action based on the generated sport data.

US Pat. No. 10,998,872

METHODS AND APPARATUS FOR AUDIO EQUALIZATION

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:a volume adjuster to apply a plurality of equalization adjustments to an audio signal to generate an equalized audio signal, the plurality of equalization adjustments output from a neural network in response to an input feature set including an average volume representation of the audio signal;
a thresholding controller to:
detect an irregularity in a frequency representation of the audio signal after application of the plurality of equalization adjustments, the irregularity corresponding to a change in volume between adjacent frequency values exceeding a threshold; and
adjust a volume at a first frequency value of the adjacent frequency values to reduce the irregularity;
an equalization (EQ) curve generator to generate an EQ curve to apply to the audio signal when the irregularity has been reduced; and
a frequency to time domain converter to output the equalized audio signal in a time domain based on the EQ curve.

US Pat. No. 10,972,203

DETECTING AND RESPONDING TO RENDERING OF INTERACTIVE VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. A method of detecting and responding to rendering of video content by a video presentation device, wherein the video content includes (i) a pre-established video segment that does not vary based on user-interaction during the rendering and (ii) a dynamically-defined video segment that varies based on user-interaction during the rendering, the method comprising:obtaining by a computing system a query fingerprint generated in real-time during the rendering as a representation of the video content being rendered, the query fingerprint including a first portion representing the pre-established video segment and a second portion representing the dynamically-defined video segment;
while obtaining the query fingerprint, the computing system (a) detecting a match between the first portion of the query fingerprint and a reference fingerprint that represents the pre-established video segment, (b) based on the detecting of the match, identifying the video content being rendered, (c) after identifying the video content being rendered, applying a trained neural network to at least the second portion of the query fingerprint, and (d) detecting, based on the applying of the neural network, that rendering of the identified video content continues; and
responsive to at least the detecting that rendering of the identified video content continues, the computing system taking action specific to the identified video content.

US Pat. No. 10,956,484

METHOD TO DIFFERENTIATE AND CLASSIFY FINGERPRINTS USING FINGERPRINT NEIGHBORHOOD ANALYSIS

Gracenote, Inc., Emeryvi...

1. A method for classification of multimedia fingerprints, the method comprising:establishing a stop-fingerprint rule for multimedia fingerprint neighborhood analysis; and
classifying multimedia fingerprints into unique fingerprints and non-unique fingerprints by applying the stop-fingerprint rule on each reference fingerprint in a reference multimedia fingerprint database,
wherein applying the stop-fingerprint rule on each reference fingerprint in the reference multimedia fingerprint database comprises, for each reference fingerprint performing operations comprising:
determining, for each stop fingerprint in a stop-fingerprint database, a respective fingerprint distance between the reference fingerprint and the stop fingerprint,
determining, for the reference fingerprint, a set of neighboring fingerprints within the stop-fingerprint database, wherein the set of neighboring fingerprints comprises the stop fingerprints within the stop-fingerprint database for which the respective fingerprint distance is less than a first threshold value;
determining a quantity of stop fingerprints in the set of neighboring fingerprints;
performing a comparison of the quantity of stop fingerprints in the set of neighboring fingerprints to a second threshold value;
if the quantity of stop fingerprints in the set of neighboring fingerprints is less than the second threshold value, then classifying the multimedia fingerprint as one of the unique fingerprints; and
if the quantity of stop fingerprints in the set of neighboring fingerprints is greater than the second threshold value, then classifying the multimedia fingerprint as one of the non-unique fingerprints,
wherein the reference fingerprints are split across a plurality of processors, each processor having a part of the reference multimedia fingerprint database.

US Pat. No. 10,945,049

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:a comparator unit arranged to:
receive a main stream and a reference stream, the main stream having a higher data rate than the reference stream and comprising successive content elements and inserted sections intermixed with the successive content elements, the reference stream defining sequential reference fingerprints that correspond to at least some of the successive content elements;
compute a main fingerprint from the successive content elements; and
compare the main fingerprint with a reference fingerprint among the sequential reference fingerprints defined by the reference stream that has a lower data rate than the main stream; and
a control unit arranged to select content elements from the main stream based on whether the main fingerprint matches the reference fingerprint among the sequential reference fingerprints defined by the reference stream that has the lower data rate than the main stream.

US Pat. No. 10,914,593

DYNAMIC CONTENT DELIVERY BASED ON VEHICLE NAVIGATIONAL ATTRIBUTES

GRACENOTE, INC., Emeryvi...

1. A system comprising:at least one processor; and
memory including instructions that, when executed, cause the at least one processor to:
determine a relevancy score for respective ones of a plurality of sporting event data items based on navigational attributes of a vehicle, the navigational attributes including at least one of a location of the vehicle, vehicle sensor data, or vehicle navigational data;
determine whether the navigational attributes correspond to a driving condition, the driving condition indicative of a driving pattern of the vehicle in response to a condition external to the vehicle; and
in response to determining that the navigational attributes correspond to the driving condition, present a first sporting event data item of the plurality of sporting event data items in response to a relevancy score of the first sporting event data item corresponding to the driving condition.

US Pat. No. 10,910,015

SYNTHESIZING A PRESENTATION FROM MULTIPLE MEDIA CLIPS

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by one or more processors, a plurality of media clips and metadata for the plurality of media clips;
using the metadata for the plurality of media clips to select, by the one or more processors, from the plurality of media clips, a subset of media clips including a first media clip and a second media clip;
matching, by the one or more processors, a first fingerprint of at least a part of the first media clip with a second fingerprint of at least a part of the second media clip;
determining, by the one or more processors, a temporal overlap of the first media clip with the second media clip based at least in part on the match of the first fingerprint of at least the part of the first media clip with the second fingerprint of at least the part of the second media clip;
based on the temporal overlap of the first media clip with the second media clip:
merging, by the one or more processors, the first media clip and the second media clip into a group of temporally-overlapping media clips, and
synthesizing, by the one or more processors, the group of temporally-overlapping media clips into a media presentation.

US Pat. No. 10,885,109

MULTIPLE STAGE INDEXING OF AUDIO CONTENT

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:determining, by at least one hardware processor, a plurality of clusters based on a comparison of a plurality of audio contents using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio contents from the plurality of audio contents;
for each cluster of the plurality of clusters, determining, by the at least one hardware processor, a representative audio content for the cluster from the at least two audio contents of the cluster;
loading, by the at least one hardware processor, the corresponding representative audio content of each cluster in the plurality of clusters into an index, the index including the representative audio content stored in association with a hash value, the hash value based on permutations of a binary vector formed using a spectral representation of the audio content;
generating a first comparison, by the at least one hardware processor, of a query audio content to each one of the representative audio contents in the index using the first matching criteria, the first comparison including a comparison of at least one of a content duration ratio, a bit error rate at a matching location, or a length of matching positions;
matching, by the at least one hardware processor, the query audio content to one of the representative audio contents based on the generated first comparison;
determining, by the at least one hardware processor, the corresponding cluster of the matched one of the representative audio contents;
generating a second comparison, by the at least one hardware processor, of the query audio content to each one of the at least two audio contents of the determined corresponding cluster of the matched one of the representative audio contents using a second matching criteria different from the first matching criteria; and
identifying, by the at least one hardware processor, a match between the query audio content and at least one of the audio contents of the determined corresponding cluster of the matched one of the representative audio contents based on the generated second comparison of the determined corresponding cluster using the first and the second matching criteria, the match used to determine similarity of the queried audio content to the representative audio content;
wherein the determining the plurality of clusters comprises comparing fingerprint data of each of the plurality of audio contents using the first matching criteria, the comparison including a granular comparison based on a sub-cluster of each of the plurality of clusters.

US Pat. No. 10,869,086

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A method for dynamically providing a video overlay, the method comprising:accessing, by one or more processors, a video input stream that includes first video content from a first content provider;
accessing, by the one or more processors, replacement video content for a video output stream, wherein the replacement video content of the video output stream is different from the first video content of the video input stream;
while accessing the video input stream, causing the replacement video content of the video output stream to be presented on a display;
while the display is presenting the replacement video content of the video output stream, determining, by the one or more processors, that a first change in the video input stream has occurred, wherein the first change indicates a change from the first content provider to a second content provider of the video input stream has occurred, wherein the video input stream includes second video content from the second content provider after the first change has occurred;
in response to determining the change from the first content provider to the second content provider has occurred, causing the second video content of the video input stream to be presented on the display instead of the replacement video content;
after causing the second video content of the video input stream to be presented on the display:
accessing, from a database, a template corresponding to a screen portion of the display for third video content generated by an intermediate device;
analyzing, by the one or more processors, the video input stream to determine that at least a portion of the third video content of a frame of the video input stream matches the template when the video input stream includes the first video content from the first content provider and the third video content generated by the intermediate device;
based on the at least the portion of the third video content of the frame of the video input stream matching the template, determining that a second change in the video input stream indicates a return to the first content provider of the video input stream; and
in response to the determination that the first content provider of the video input stream has been returned to:
modifying the video output stream to include the replacement video content and the third video content, and
causing a presentation of the replacement video content and the third video content of the modified video output stream on the display, wherein the third video content is presented as the video overlay on the replacement video content.

US Pat. No. 10,827,224

MODIFYING PLAYBACK OF REPLACEMENT CONTENT RESPONSIVE TO DETECTION OF REMOTE CONTROL SIGNALS THAT CONTROL A DEVICE PROVIDING VIDEO TO THE PLAYBACK DEVICE

Gracenote, Inc., Emeryvi...

1. A method comprising:identifying, by a playback device, a media device that provides media content to the playback device;
obtaining, by the playback device, a set of playback-modification actions corresponding to the media device;
providing, by the playback device, replacement media content for display;
while the playback device is displaying the replacement media content, receiving, using a receiver of the playback device, an instruction that a remote control transmitted to the media device;
selecting, by the playback device, a playback-modification action corresponding to the instruction from among the set of playback-modification actions corresponding to the media device; and
modifying, by the playback device, playback of the replacement media content in accordance with the playback-modification action.

US Pat. No. 10,776,419

AUDIO FILE QUALITY AND ACCURACY ASSESSMENT

GRACENOTE DIGITAL VENTURE...

1. A method performed by a computing system, the method comprising:receiving a plurality of audio files from a plurality of audio file creators, wherein an individual audio file of the plurality of audio files comprises an audible version of a corresponding text-based news story;
storing the plurality of received audio files in a tangible, non-transitory computer readable media;
determining whether the individual audio file is acceptable for distribution to a plurality of subscribers for playback based on a computer analysis of the individual audio file, wherein determining whether the individual audio file is acceptable for distribution to a plurality of subscribers for playback based on a computer analysis of the individual audio file comprises determining whether an average volume of the individual audio file is (i) less than or equal to a maximum average volume threshold and (ii) greater than or equal to a minimum average volume threshold;
in response to determining that the individual audio file is acceptable for distribution, approving the individual audio file for distribution;
in response to determining that the individual audio file is not acceptable for distribution, determining whether the individual audio file that is not acceptable for distribution can be remediated;
in response to determining that the individual audio file that is not acceptable for distribution can be remediated, remediating the individual audio file that is not acceptable for distribution;
in response to determining that the individual audio file that is not acceptable for distribution cannot be remediated, determining whether the individual file that is not acceptable for distribution is not so flawed as to require rejection and should be reviewed by a human;
in response to determining that the individual audio file that is not acceptable for distribution but not so flawed as to require rejection should be reviewed by a human, designating the individual audio file for review by a human;
in response to determining that the individual audio file that is not acceptable for distribution and cannot be remediated should not be reviewed by a human, rejecting the individual audio file that is not acceptable for distribution and cannot be remediated; and
wherein the computing system is configured to perform at least all three actions of rejecting individual audio files, remediating individual audio files, and designating individual audio files for review by a human.

US Pat. No. 10,735,119

MODIFYING PLAYBACK OF CONTENT USING PRE-PROCESSED PROFILE INFORMATION

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by executing an instruction with a processor, a media stream to be provided to a device via a network, the media stream including a piece of media;
identifying, by executing an instruction with the processor, the piece of media using a comparison of a reference fingerprint of the piece of media to a query fingerprint generated based on the piece of media;
selecting, by executing an instruction with the processor, a profile of equalization parameters from a plurality of profiles, the selected profile corresponding to the identified piece of media and specifying at least one corresponding output frequency parameter to be used when outputting the piece of media at the device, the selecting of the profile being enabled by the identifying of the piece of media; and
providing, in response to the selection of the profile of equalization parameters and by executing an instruction with the processor, the selected profile of equalization parameters and the media stream to the device via the network.

US Pat. No. 10,701,411

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device and a sequence of media content that includes a first portion followed by a second portion, the method comprising:presenting, by the client device, the first portion of the sequence of media content;
performing, by the client device, a content replacement operation, wherein performing the content replacement operation comprises presenting replacement media content instead of the second portion of the sequence of media content;
while performing the content replacement operation:
(i) monitoring, by the client device, advancement of the second portion of the sequence of media content by comparing fingerprint data representing the second portion of the sequence of media content with reference fingerprint data representing expected media content, and
(ii) based on the monitoring, determining, by the client device, that the second portion of the sequence of media content was interrupted by interrupting media content, wherein determining that the second portion of the sequence of media content was interrupted by interrupting media content comprises determining that the comparing resulted in no match being found; and
responsive to determining that the second portion of the sequence of media content was interrupted by interrupting media content, switching, by the client device, from presenting the replacement media content to presenting the interrupting media content instead.

US Pat. No. 10,701,412

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device and a sequence of media content that includes a first portion followed by a second portion, the method comprising:presenting, by the client device, the first portion of the sequence of media content;
performing, by the client device, a content replacement operation, wherein performing the content replacement operation comprises presenting replacement media content instead of the second portion of the sequence of media content;
while performing the content replacement operation:
(i) generating, by the client device, fingerprint data representing the second portion of the sequence of media content,
(ii) sending, by the client device, the generated fingerprint data to a server device to facilitate the server device comparing the generated fingerprint data with reference fingerprint data representing expected media content, and
(iii) responsive to sending the generated fingerprint data to the server device, receiving an instruction to switch from presenting the replacement media content to presenting interrupting media content instead, wherein the received instruction is generated by the server device responsive to the server device determining that the sequence of media content was interrupted by interrupting media content, and wherein determining that the sequence of media content was interrupted by interrupting media content comprises (i) comparing the generated fingerprint data with reference fingerprint data representing expected media content, and (ii) determining that the comparing resulted in no match being found between the generated fingerprint data and the reference fingerprint data; and
responsive to receiving the instruction, switching, by the client device, from presenting the replacement media content to presenting the interrupting media content.

US Pat. No. 10,678,828

MODEL-BASED MEDIA CLASSIFICATION SERVICE USING SENSED MEDIA NOISE CHARACTERISTICS

GRACENOTE, INC., Emeryvi...

1. A method for classifying media, the method comprising:accessing, using one or more processor circuits associated with a first device, first digital media data that represents a first media query to be identified;
determining, using the one or more processor circuits associated with the first device, a first noise characteristic corresponding to the first digital media data;
determining whether a prior classification attempt for a second media query was successful, the second media query corresponding to second digital media data having a similar noise characteristic to the first noise characteristic;
in response to determining the prior classification attempt was successful, transmitting the first digital media data to a remote media classification circuit that is configured to identify a source characteristic of the first digital media data; and
in response to determining the prior classification attempt unsuccessful, inhibiting the transmission of the first digital media data to the remote media classification circuit.

US Pat. No. 11,107,458

AUDIO STREAMING OF TEXT-BASED ARTICLES FROM NEWSFEEDS

Gracenote Digital Venture...


1. A method comprising:receiving, by a server device, a representation of a plurality of uniform resource locators (URLs) of text-based articles;
retrieving, by the server device, the text-based articles and respective publication dates of each of the text-based articles from one or more newsfeed servers;
causing, by the server device, the text-based articles to be converted into audio files;
receiving, by the server device, a request to stream the audio files to a client device;
generating, by the server device, a playlist with references to the audio files ordered chronologically according to the respective publication dates of their associated text-based articles; and
transmitting, to the client device, the playlist.

US Pat. No. 11,089,360

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...


1. A system comprising:a network communication interface;
at least one processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the at least one processing unit to carry out operations including:determining that digital video fingerprint data representing media content being rendered by a media presentation device matches reference video fingerprint data corresponding with multiple channels,
responsive to at least the determining that the digital video fingerprint data matches the reference video fingerprint data corresponding with the multiple channels, performing disambiguation based at least in part on a determination that digital audio fingerprint data representing the media content being rendered by the media presentation device matches reference audio fingerprint data corresponding with just a single channel of the multiple channels, the disambiguation establishing that the media content being rendered by the media presentation device is media content of the single channel, and
taking action based on the establishing that the media content being rendered by the media presentation device is media content of the single channel.


US Pat. No. 11,086,591

METHODS AND APPARATUS FOR DYNAMIC VOLUME ADJUSTMENT VIA AUDIO CLASSIFICATION

GRACENOTE, INC., Emeryvi...


1. An apparatus, comprising:an audio signal classifier to analyze, with a neural network, a parameter of an audio signal associated with a first volume level to determine a classification group associated with the audio signal;
an input volume detector to determine an input volume of the audio signal;
a volume adjuster to:determine a classification gain value based on the classification group;
determine an intermediate gain value as an intermediate between the input volume and the classification gain value by applying a first weight to the input volume and a second weight to the classification gain value; and
apply the intermediate gain value to the audio signal, the intermediate gain value to modify the first volume level to a second volume level; and

a dynamic range compressor to apply a compression value to the audio signal, the compression value to modify the second volume level to a third volume level that satisfies a target volume threshold.

US Pat. No. 11,087,161

METHODS AND SYSTEMS FOR DETERMINING ACCURACY OF SPORT-RELATED INFORMATION EXTRACTED FROM DIGITAL VIDEO FRAMES

Gracenote, Inc., Emeryvi...


1. A method for determining accuracy of sport-related information extracted from a time sequence of digital video frames that represent a sport event, wherein the extracted sport-related information includes an attribute that changes over the time sequence, the method comprising:detecting by a computing system, based on the extracted sport-related information, a pattern of change of the attribute over the time sequence, wherein the computing system comprises a processing unit and the computing system has access to mapping data that indicates, for each of a plurality of sport events, an expected pattern of change of the attribute;
identifying, by the computing system, the sport event represented by the digital video frames;
determining, by the computing system, the expected pattern of change of the attribute indicated by the mapping data for the identified sport event;
making a determination, by the computing system, of whether the detected pattern of change of the attribute over the time sequence is the expected pattern of change of the attribute indicated by the mapping data for the identified sport event;
if the determination is that the detected pattern of change of the attribute over the time sequence is the expected pattern of change of the attribute indicated by the mapping data of the identified sport event, then, responsive to making the determination, the computing system taking a first action that corresponds to the sport-related information being accurate; and
if the determination is that the detected pattern of change of the attribute over the time sequence is not the expected pattern of change of the attribute indicated by the mapping data for the identified sport event, then, responsive to making the determination, the computing system taking a second action that corresponds to the sport-related information being inaccurate.

US Pat. No. 11,081,101

REAL TIME POPULARITY BASED AUDIBLE CONTENT ACQUISITION

Gracenote Digital Venture...


1. A method comprising:analyzing, by a computing device, generated playlists for a plurality of client devices, wherein the generated playlists contain ordered references to audio files of media content items respectively customized for the client devices;
determining, by the computing device, that a media content item has a threshold level of popularity based on a number of references to an initial version of an audio file of the media content item that are included in the generated playlists;
responsive to determining that the media content item has the threshold level of popularity, transmitting, by the computing device and to a remote recording station, a request for a human verbal reading of the media content item;
receiving, by the computing device and from the remote recording station, an audio file of the human verbal reading;
updating, by the computing device, the references to the initial version of the audio file of the media content item to refer to the audio file of the human verbal reading; and
transmitting, by the computing device, a playlist to a client device of the plurality of client devices, wherein the playlist includes a plurality of ordered references to audio files of media content items, including the audio file of the human verbal reading.

US Pat. No. 11,080,549

AUTOMATED CROPPING OF IMAGES USING A MACHINE LEARNING PREDICTOR

Gracenote, Inc., Emeryvi...


12. A system comprising:one or more processors; and
memory storing instructions that, when executed by the one or more processors, cause the system to carry out operations of a machine learning (ML) predictor program, wherein the operations include:
receiving as input to the ML predictor program a first plurality of training raw images, each respective training raw image of the first plurality being associated with a respective set of training master images, each training master image of a given respective set of training master images indicating respective cropping characteristics defined for the associated respective training raw image, wherein the ML predictor program is configured to generate predicted cropping characteristics for any given input image, and wherein cropping characteristics for any particular input image comprise coordinates of cropping boundaries with respect to the particular input image prior to cropping;
training the ML predictor program to predict cropping characteristics for each respective training raw image based on expected cropping characteristics represented in the associated respective set of training master images;
subsequent to training the ML predictor program with the first plurality of training raw images, applying the trained ML predictor program to a second plurality of runtime raw images input to the trained the ML predictor program in order to generate for each respective runtime raw image of the second plurality a respective set of runtime cropping characteristics, wherein the respective set of runtime cropping characteristics for each respective runtime raw image comprises one or more subsets of cropping coordinates for the respective runtime raw image, and wherein each subset corresponds to a different cropped version of the respective runtime raw image; and
storing, in non-transitory computer-readable memory, the second plurality of runtime raw images together with the respective set of runtime cropping characteristics for each respective runtime raw image of the second plurality.

US Pat. No. 11,062,127

OBTAINING ARTIST IMAGERY FROM VIDEO CONTENT USING FACIAL RECOGNITION

Gracenote, Inc., Emeryvi...


1. A method comprising:applying an automated face detection program implemented on a computing device to a first plurality of training digital images associated with a particular television (TV) program to identify a first sub-plurality of the training digital images, each of which contains a single face of a first particular person associated with the particular TV program;
based on a first set of feature vectors determined for the first sub-plurality of training digital images, training a first computational model of a computer-implemented face recognition program for recognizing the first particular person in any given digital image;
applying the face recognition program together with the first computational model to a runtime digital image associated with the particular TV program to recognize the first particular person in the runtime digital image from among one or more faces detected, together with respective geometric coordinates, in the runtime digital image; and
storing, in non-transitory computer-readable memory, the runtime digital image together with information identifying the recognized first particular person and corresponding geometric coordinates of the recognized first particular person in the runtime digital image.

US Pat. No. 11,036,995

METHODS AND SYSTEMS FOR SCOREBOARD REGION DETECTION

Gracenote, Inc., Emeryvi...

1. A method for automatically detecting, in a sequence of video frames, a video frame region that depicts a scoreboard, the method comprising:receiving, by a computing system, the sequence of video frames, wherein the video frames of the sequence depict image elements including (i) scoreboard image elements that are unchanging across the video frames of the sequence and (ii) other image elements that change across the video frames of the sequence, wherein the computing system comprises a processing unit;
engaging, by the computing system, in an edge-detection process to detect, in the video frames of the sequence, a set of edges of the depicted image elements;
identifying, by the computing system, a subset of the detected set of edges based on each edge of the subset being unchanging across the video frames of the sequence, wherein the edges of the identified subset are candidate edges of the scoreboard image elements;
detecting by the computing system, based on the edges of the identified subset, the video frame region that depicts the scoreboard;
executing, by the computing system, a machine-driven pattern recognition procedure to extract a sport score from the detected video frame region in a given video frame of the sequence;
making a determination, by the computing system, of whether the machine-driven pattern recognition procedure successfully extracted the sport score from the detected video frame region in the given video frame;
if the determination is that the machine-driven pattern recognition procedure successfully extracted the sport score from the detected video frame region in the given video frame, then, responsive to making the determination, the computing system taking a first action that corresponds to the detected video frame region being accurate; and
if the determination is that the machine-driven pattern recognition procedure did not successfully extract the sport score from the detected video frame region in the given video frame, then, responsive to making the determination, the computing system taking a second action that corresponds to the detected video frame region being inaccurate.

US Pat. No. 11,037,603

COMPUTING SYSTEM WITH DVE TEMPLATE SELECTION AND VIDEO CONTENT ITEM GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, a first group of video content items;
determining, by the computing system, a quality score for one or more video content items of the first group of video content items;
identifying, by the computing system, from among the first group of video content items, a second group of video content items based on the second group of video content items (i) having a threshold extent of similarity with each other and (ii) each having a quality score that exceeds a quality score threshold; and
based on the identifying of the second group of video content items, transmitting, by the computing system, at least a portion of at least one video content item of the identified second group of video content items to a digital video-effect (DVE) system, wherein the DVE system is configured for (i) receiving the transmitted at least the portion of the at least one video content item of the identified second group of video content items, (ii) using a number of video content items in the identified second group of video content items to select a DVE template, and (iii) using (a) the received at least the portion of the at least one video content item of the identified second group of video content items and (b) the DVE template, to generate a video content item that includes the received at least the portion of the at least one video content item of the identified second group of video content items.

US Pat. No. 11,024,288

METHODS AND APPARATUS TO SEGMENT AUDIO AND DETERMINE AUDIO SEGMENT SIMILARITIES

GRACENOTE, INC., Emeryvi...

1. A method, comprising:developing features characterizing beats of audio with a neural network, the neural network trained to develop the features characterizing beats of audio based on an anchor index beat corresponding to a selected beat in training audio;
computing a self-similarity matrix based on the features; and
identifying peaks in audio corresponding to beats of audio based on thresholding a crest factor at points in the self-similarity matrix, the peaks to be mapped to timestamps of the audio.

US Pat. No. 11,024,301

MODIFICATION OF ELECTRONIC SYSTEM OPERATION BASED ON ACOUSTIC AMBIENCE CLASSIFICATION

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:a noise cancellation module to filter a first audio signal to generate a first filtered audio signal, the first audio signal generated via a first sensor in an environment during presentation of media content in the environment via a media player;
a classifier to:
assign a first classification to the first filtered audio signal based on a first feature of the first filtered audio signal;
assign a second classification to a second audio signal based on a second feature of the second audio signal, the second audio signal generated via the first sensor or a second sensor in the environment during the presentation of the media content; and
instruct the noise cancellation module to filter the second audio signal to cause the noise cancellation module to generate a second filtered audio signal based on the assignment of the second classification to the second audio signal; and
a rules engine to instruct the media player to adjust the media content based on the one of the first filtered audio signal or the second filtered audio signal to generate adjusted media content for presentation in the environment.

US Pat. No. 11,012,738

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to cause a computing system to carry out operations comprising:detecting that digital fingerprint data representing media content being rendered by a media presentation device matches multiple instances of reference fingerprint data each corresponding with a different respective channel;
responsive to the detecting, performing disambiguation based at least in part on a location of the media presentation device, to determine which channel the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple instances of reference fingerprint data, determining that the respective channel with which the particular instance of reference fingerprint data corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.

US Pat. No. 11,012,743

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A method comprising:determining by a computing system that first digital fingerprint data representing first one or more segments of media content being rendered by a media presentation device matches reference fingerprint data corresponding respectively with each of multiple different channels,
after determining that the first digital fingerprint data matches reference fingerprint data corresponding respectively with each of multiple different channels, performing disambiguation based at least in part on an earlier determination that second digital fingerprint data representing second one or more segments of the media content being rendered by the media presentation device matches reference fingerprint data corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, and
causing action to be taken based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,992,981

DYNAMIC CONTROL OF FINGERPRINTING RATE TO FACILITATE TIME-ACCURATE REVISION OF MEDIA CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:generating query fingerprints representing a media stream being received by a client, wherein the query fingerprints are generated at a frame rate;
providing the generated query fingerprints for use to identify the media stream being received by the client based on comparison with reference fingerprints representing a known media stream;
after identification of the media stream being received by the client, increasing the frame rate of generating the query fingerprints;
providing the query fingerprints generated at the increased frame rate, for use to establish synchronous lock between (i) true-time defined along a timeline within the known media stream and (ii) client-time defined according to a clock of the client; and
using the established synchronous lock as a basis to determine a client-time point at which the client is to perform a content revision of the media stream being received by the client.

US Pat. No. 10,977,424

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:determining, by a server, a set of attributes of a user of a client device;
based on the determined set of attributes, generating, by the server, a sequence of content items, wherein at least one content item in the sequence comprises multiple portions, and wherein at least one of the multiple portions represents text;
preparing the at least one content item, wherein the preparing comprises:
(i) extracting, from the at least one content item of the sequence, at least one of the multiple portions that represents text;
(ii) making a determination that the extracted at least one of the multiple portions representing text has a particular characteristic; and
(iii) based on the determination, modifying the extracted at least one of the multiple portions representing text;
generating data representing a concatenation of (i) the prepared at least one content item of the sequence and (ii) at least one other content item of the sequence; and
providing output based on the generated data.

US Pat. No. 10,979,742

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving usage data for a client device that is configured to present media content sequences;
based on the received usage data, selecting, from among a plurality of reference fingerprints, a subset of reference fingerprints;
sending to the client device, the selected subset of reference fingerprints;
receiving from the client device, an indication that the client device has detected a match between (i) one of more of the selected subset of reference fingerprints, and (ii) one or more fingerprints generated by the client device; and
determining, based on the received indication from the client device, an identity of a media sequence being presented on the client device.

US Pat. No. 10,979,758

DYNAMIC CONTROL OF FINGERPRINTING RATE TO FACILITATE TIME-ACCURATE REVISION OF MEDIA CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, query fingerprints representing a media stream being received by a client, wherein the query fingerprints are generated by the client at a frame rate;
determining by a computing system that the media stream being received by the client is a known media stream, wherein the determining is based on a comparison by the computing system of (i) the query fingerprints generated at the frame rate with (ii) reference fingerprints representing the known media stream;
responsive to the determining, causing by the computing system the client to increase the frame rate at which the client generates the query fingerprints being received by the computing system;
using by the computing system the query fingerprints generated at the increased frame rate as a basis to establish synchronous lock between (i) true time defined along a timeline within the known media stream and (ii) client time defined according to a clock of the client;
using by the computing system the established synchronous lock as a basis to determine a client-time point at which the client should perform ad replacement in the media stream being received by the client; and
causing the client to perform, at the determined client-time point, the ad replacement.

US Pat. No. 10,979,771

DISPLAYING AN ACTIONABLE ELEMENT OVER PLAYING CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:identifying video content currently playing within a display environment provided by a playback device;
determining a location within the display environment at which to display an actionable element, wherein determining the location comprises:
identifying an area within the display environment in which the playback device displays a navigation element native to the playback device, and
based on the identifying, selecting the identified area within the display environment as the location within the display environment at which to display the actionable element;
in response to determining the location within the display environment at which to display the actionable element, displaying the actionable element at the determined location within the displaying environment, wherein the actionable element is based on the identified video content and includes a user-selectable option to perform an action associated with the identified video content;
identifying different video content currently playing within the display environment provided by the playback device; and
modifying a configuration of the displayed actionable element based on the identified different video content.

US Pat. No. 10,979,772

METHOD AND SYSTEM FOR PRESENTING ADDITIONAL CONTENT AT A MEDIA SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, a query fingerprint for comparison with a plurality of reference fingerprints, wherein the query fingerprint represents media content being received by a client device;
determining, by the computing system, a usage history of the client device, wherein the determining the usage history of the client device comprises identifying a type of media content based on an extent to which the client device has received the type of media content;
using, by the computing system, the determined usage history as a basis to select a first subset of the plurality of reference fingerprints, wherein using the determined usage history as a basis to select the first subset of the plurality of reference fingerprints comprises selecting the first subset of the plurality of reference fingerprints based on the reference fingerprints of the first subset corresponding to the identified type of media content; and
performing, by the computing system, a first fingerprint comparison process comprising:
(i) comparing the query fingerprint to the first subset of the plurality of reference fingerprints;
(ii) determining that the comparison to the first subset does not result in a match between the query fingerprint and one of the reference fingerprints of the first subset; and
(iii) responsive to determining that the comparison to the first subset does not result in a match between the query fingerprint and one of the reference fingerprints of the first subset, performing, by the computing system, a second fingerprint comparison process comprising:
(a) comparing the query fingerprint to a second subset of the plurality of reference fingerprints, wherein the second subset is different from the first subset;
(b) identifying a match between the query fingerprint and one of the reference fingerprints of the second subset; and
(c) using the one of the reference fingerprints of the second subset to identify the media content being received by the client device.

US Pat. No. 10,977,307

METHOD AND APPARATUS FOR MULTI-DIMENSIONAL CONTENT SEARCH AND VIDEO IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method comprising:obtaining, by a computing system, a first query video signature and a second query video signature corresponding to a query video;
identifying, by the computing system, first reference video signatures of a video database that relate to the first query video signature;
identifying, by the computing system, second reference video signatures of the video database that relate to the second query video signature;
combining, by the computing system, the first reference video signatures and the second reference video signatures into a candidate list;
selecting, by the computing system, a reference video signature from the candidate list as a starting point of a sequence, wherein the reference video signature corresponds to a reference frame of a reference video;
determining, by the computing system, a sequence score based on a comparison between a query sequence of the query video and a reference sequence of the reference video;
determining, by the computing system, that the sequence score satisfies a threshold; and
based on the determining that the sequence score satisfies the threshold, outputting, by the computing system, an indication of the reference video.

US Pat. No. 10,970,588

RECURRENT DEEP NEURAL NETWORK SYSTEM FOR DETECTING OVERLAYS IN IMAGES

Gracenote, Inc., Emeryvi...

1. A method comprising:a processor applying a feature map network to an image to create a feature map, wherein the feature map comprises a grid of vectors characterizing at least one feature in the image;
the processor applying a probability map network to the feature map to create a probability map assigning a probability to the at least one feature in the image, wherein the assigned probability corresponds to a likelihood that the at least one feature is an overlay;
the processor determining that the assigned probability exceeds a threshold probability value; and
responsive to the processor determining that the assigned probability exceeds the threshold probability value, the processor (i) determining compliance with an advertising requirement, (ii) performing optical character recognition (OCR) on the at least one feature, (iii) determining a channel associated with the at least one feature, (iv) determining a show associated with the at least one feature, (v) determining a hyperlink associated with the at least one feature, or (vi) accessing a hyperlinked object associated with the at least one feature.

US Pat. No. 10,972,204

DETECTING AND RESPONDING TO RENDERING OF INTERACTIVE VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. Non-transitory data storage storing program instructions executable by one or more processors to carry out operations to detect and respond to rendering of video content by a video presentation device, wherein the video content includes (i) a pre-established video segment that does not vary based on user-interaction during the rendering and (ii) a dynamically-defined video segment that varies based on user-interaction during the rendering, the operations comprising:obtaining a query fingerprint generated in real-time during the rendering as a representation of the video content being rendered, the query fingerprint including a first portion representing the pre-established video segment and a second portion representing the dynamically-defined video segment;
while obtaining the query fingerprint, (a) detecting a match between the first portion of the query fingerprint and a reference fingerprint that represents the pre-established video segment, (b) based on the detecting of the match, identifying the video content being rendered, (c) after identifying the video content being rendered, applying a trained neural network to at least the second portion of the query fingerprint, and (d) detecting, based on the applying of the neural network, that rendering of the identified video content continues; and
responsive to at least the detecting that rendering of the identified video content continues, taking action specific to the identified video content.

US Pat. No. 10,971,190

SYNTHESIZING A PRESENTATION FROM MULTIPLE MEDIA CLIPS

Gracenote, Inc., Emeryvi...

1. A system comprising:a processor-implemented media ingestion module configured to access a plurality of media clips including a first video clip and a second video clip;
a media analysis module configured to:
match a first fingerprint of at least a part of the first video clip with a second fingerprint of at least a part of the second video clip;
match a first event location of at least a part of the first video clip with a second event location of at least a part of the second video clip; and
determine an overlap of the first video clip with the second video clip based at least in part on the match of the first fingerprint of at least the part of the first video clip with the second fingerprint of at least the part of the second video clip and the match of the first event location of at least the part of the first video clip with the second event location of at least the part of the second video clip; and
a content creation module configured to:
merge the first video clip and the second video clip into a group of overlapping video clips based on the overlap of the first video clip with the second video clip; and
synthesize the group of overlapping video clips into a video presentation.

US Pat. No. 10,972,763

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICE

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device that is configured to present media content sequences, the method comprising:sending, by the client device, usage data for the client device;
responsive to sending the usage data, receiving, by the client device, a subset of reference fingerprints selected from among a plurality of reference fingerprints and selected based on the usage data;
receiving, by the client device, one or more media content sequences;
generating, by the client device, one or more fingerprints of the received one or more media content sequences; and
detecting, by the client device, a match between (i) one or more reference fingerprints in the received subset of reference fingerprints and (ii) the generated one or more fingerprints.

US Pat. No. 10,972,764

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device that is configured to present media content sequences, the method comprising:receiving, by the client device, one or more reference fingerprints;
receiving, by the client device, one or more media content sequences;
generating, by the client device, one or more fingerprints of the received one or more media content sequences;
detecting that no match exists between (i) the received one or more reference fingerprints and (ii) the generated one or more fingerprints; and
responsive to the detecting, sending, by the client device, to one or more server devices, a request comprising the generated one or more fingerprints for comparison with one or more reference fingerprints stored in association with the one or more server devices.

US Pat. No. 10,972,766

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICE

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device that is configured to present media content sequences, the method comprising:receiving, by the client device, one or more reference fingerprints, wherein the one or more reference fingerprints is associated with channel data for a particular channel;
receiving, by the client device, one or more media content sequences;
generating, by the client device, one or more fingerprints of the received one or more media content sequences;
detecting a match between (i) the received one or more reference fingerprints and (ii) the generated one or more fingerprints; and
responsive to detecting the match, sending, by the client device, to one or more server devices, a message that comprises the channel data.

US Pat. No. 10,972,786

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON MATCHING WITH DIFFERENTIAL REFERENCE- FINGERPRINT FEATURE

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive video content to be presented by the media presentation device, wherein the video content includes video frames having video frame regions, the video frame regions comprising a center, an edge, and a corner;
a media presentation interface for presenting the received video content; and
a network communication interface,
wherein the media presentation device is configured to generate first query fingerprint data representing the video content based on analysis of the video content, and to output the generated first query fingerprint data for transmission through the network communication interface to a server,
wherein the media presentation device is configured to receive from the server, after outputting the first query fingerprint data, a request for second query fingerprint data specifically focused on an identified video frame region of the video frame regions of the video content, wherein the identified video frame region defines a difference between multiple channels that each have reference fingerprint data matching the first query fingerprint data,
wherein the media presentation device is configured to output, for transmission through the network communication interface to the server, the requested second query fingerprint data specifically focused on the identified video frame region of the video content, and
wherein the media presentation device is configured to present, in conjunction with the video content that the media presentation device is presenting, supplemental channel-specific content associated with one of the multiple channels, the one channel being identified from among the multiple channels based on a determination that the second query fingerprint data matches a reference fingerprint of just the one channel of the multiple channels.

US Pat. No. 10,958,966

SYNCHRONIZING STREAMING MEDIA CONTENT ACROSS DEVICES

GRACENOTE, INC., Emeryvi...

1. A method comprising:determining, by executing an instruction on a first computing device, a first audio fingerprint of audio associated with broadcast media streaming on a second computing device;
identifying, by executing an instruction on the first computing device, the broadcast media streaming to the second computing device based on the first audio fingerprint;
identifying, by executing an instruction on the first computing device, a first source streaming the broadcast media to the second computing device, and one or more sources broadcasting the broadcast media streaming to the second computing device, the one or more sources available to the first computing device;
selecting a second source of the identified one or more sources for streaming the broadcast media to the first computing device, the second source different than the first source;
storing, on the first computing device, a predetermined amount of the audio associated with the broadcast media as the broadcast media is streamed to the second computing device;
detecting, by executing an instruction on the first computing device, termination of the streaming of the broadcast media on the second computing device;
generating, by executing an instruction on the first computing device, a second audio fingerprint associated with the predetermined amount of audio to determine a time when the streaming of the broadcast media was terminated on the second computing device; and
automatically starting, by using the selected second source, streaming of the broadcast media to the first computing device at the determined time when the streaming of the broadcast media was terminated on the second computing device.

US Pat. No. 10,931,390

VEHICLE-BASED MEDIA SYSTEM WITH AUDIO AD AND VISUAL CONTENT SYNCHRONIZATION FEATURE

GRACENOTE, INC., Emeryvi...

1. A vehicle-based media system comprising:a user interface;
a microphone;
one or more processors; and
a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the vehicle-based media system to perform a set of operations comprising:
receiving audio content;
causing one or more speakers to output the received audio content;
determining that the received audio content does not include information indicating musical audio content;
in response to determining that the received audio content does not include information indicating musical audio content, using the microphone of the vehicle-based media system to capture the output audio content;
identifying reference audio content that has at least a threshold extent of similarity with the captured audio content;
identifying visual content based at least on the identified reference audio content; and
outputting, via the user interface of the vehicle-based media system, the identified visual content.

US Pat. No. 10,923,155

SYNTHESIZING A PRESENTATION FROM MULTIPLE MEDIA CLIPS

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by one or more processors of a server computing device, a plurality of media clips including a first video clip and a second video clip;
matching, by the one or more processors, a first fingerprint of at least a part of the first video clip with a second fingerprint of at least a part of the second video clip;
determining, by the one or more processors, a temporal overlap of the first video clip with the second video clip based at least in part on the match of the first fingerprint of at least the part of the first video clip with the second fingerprint of at least the part of the second video clip; and
based on the temporal overlap of the first video clip with the second video clip:
merging, by the one or more processors, the first video clip and the second video clip into a group of temporally-overlapping video clips,
transmitting, by the one or more processors, to a client computing device, data identifying the group of temporally-overlapping video clips and specifying a synchronization of the first video clip with the second video clip, and
generating, by the one or more processors, for display on a display device of the client computing device, a graphical user interface that identifies the group of temporally-overlapping video clips, specifies the synchronization of the first video clip with the second video clip, and allows access to, and manipulation of, the first and second video clips.

US Pat. No. 10,924,816

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by a media device, a media stream that includes a first sequence of items of content corresponding to a first channel and a second sequence of items of content corresponding to a second channel;
accessing, by the media device, a first reference stream defining a sequence of first reference fingerprints corresponding to respective segments of at least some of the items of content in the first sequence;
accessing, by the media device, a second reference stream defining a sequence of second reference fingerprints corresponding to respective segments of at least some of the items of content in the second sequence;
computing, by a processor of the media device, a first computed fingerprint from a particular segment of the first sequence of items of content of the media stream and a second computed fingerprint from a particular segment of the second sequence of items of content of the media stream;
performing, by the processor of the media device, comparisons of (i) the first computed fingerprint to one or more of the first reference fingerprints and (ii) the second computed fingerprint to one or more of the second reference fingerprints; and
deciding, based on the comparisons, whether to render the particular segment of the first sequence of items of content or the particular segment of the second sequence of items of content.

US Pat. No. 10,917,697

INTERACTIVE PROGRAMMING GUIDE

Gracenote, Inc., Emeryvi...

1. A system comprising:at least one processor; and
a machine-readable medium storing executable instructions which, when executed, cause the at least one processor to perform operations including:
displaying an interactive programming guide to be displayed on a media content device, the interactive programming guide including a personalized lineup, the personalized lineup including a corresponding media content identification assigned to each one of a plurality of sequential time slots, each media content identification identifying media content for a corresponding time slot;
in response to a first selection of a first one of the time slots of the personalized lineup, displaying a plurality of media content category options to be displayed on the media content device while maintaining the display of the personalized lineup, a first media content category option including a plurality of media content options, the plurality of media content options determined by:
determining relevance measurements for the plurality of media content options based on a profile;
ranking the relevance measurements; and
determining a presentation order of the plurality of media content options for the first media content category option based on the ranking; and
in response to a second selection of a first one of the plurality of media content options, modifying the first one of the time slots of the personalized lineup to assign a media content identification corresponding to the first one of the plurality of media content options of the first media content category option to the first one of the time slots.

US Pat. No. 10,902,043

RESPONDING TO REMOTE MEDIA CLASSIFICATION QUERIES USING CLASSIFIER MODELS AND CONTEXT PARAMETERS

GRACENOTE, INC., Emeryvi...

1. A method for classifying media, the method comprising:determining, by executing an instruction with one or more processor circuits, a first search scope characteristic associated with a first context parameter received from a remote device, the first search scope characteristic representing a subset of media categories to which a first media query received from the remote device may correspond;
selecting, by executing an instruction with the one or more processor circuits, a first classification model that previously analyzed the first media query to analyze a second media query when a second context parameter associated with the second media query is within a threshold of change with respect to the first context parameter, the first classification model selected from a plurality of classification models based on the first search scope characteristic, ones of the plurality of classification models trained based on training data and tuned for use with specific search scope characteristics;
determining, by executing an instruction with the one or more processor circuits, a media type probability index for the second media query using the first classification model by inputting characteristics corresponding to the second media query into the first classification model, wherein the media type probability index indicates a likelihood that the second media query corresponds to a media category of the subset of media categories; and
transmitting at least one of the media type probability index or the media category to the remote device.

US Pat. No. 10,880,023

VEHICLE-BASED MEDIA SYSTEM WITH AUDIO ADVERTISEMENT AND EXTERNAL-DEVICE ACTION SYNCHRONIZATION FEATURE

GRACENOTE, INC., Emeryvi...

1. A vehicle-based media system comprising:a network interface for communicating with an external computing device that is distinct from the vehicle-based media system;
a microphone;
one or more processors; and
a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the vehicle-based media system to perform a set of operations comprising:
receiving audio content;
causing one or more speakers to output the received audio content;
using the microphone of the vehicle-based media system to capture the output audio content;
identifying reference audio content that has at least a threshold extent of similarity with the captured audio content;
identifying a computational action based at least on the identified reference audio content; and
sending, via the network interface of the vehicle-based media system, an instruction that causes the external computing device to perform the identified computational action.

US Pat. No. 10,860,647

SYSTEMS, METHODS, AND APPARATUS TO IMPROVE MEDIA IDENTIFICATION

GRACENOTE, INC., Emeryvi...

1. An apparatus to improve media identification, the apparatus comprising:a hash handler to generate a first set of reference matches by performing hash functions on a subset of media data included in a fingerprint associated with media to generate hashed media data, the first set of reference matches based on matching the hashed media data to a hash table based on a first bucket size, ones of the first set of reference matches having a respective quantity of one or more hits to be compared to a threshold;
a candidate determiner to:
identify a second set of reference matches that include one or more reference matches of the first set, the second set including first reference matches having first quantities of hits that do not satisfy the threshold;
determine second quantities of hits for ones of the second set by matching ones of the second set to the hash table based on a second bucket size, the second bucket size different than the first bucket size; and
identify one or more candidate matches based on at least one of (1) ones of the first set having the first quantities of hits that satisfy the threshold or (2) ones of the second set having third quantities of hits that satisfy the threshold, the third quantities of hits based on the first quantities of hits and the second quantities of hits; and
a report generator to generate a report including an identification of the media associated with the fingerprint based on the one or more candidate matches.

US Pat. No. 10,824,222

DETECTING AND RESPONDING TO AN EVENT WITHIN AN INTERACTIVE VIDEOGAME

Gracenote, Inc., Emeryvi...

1. A method comprising:detecting, by a machine, that an identifier established from presentation by a device of an interactive videogame matches a reference identifier corresponding with an event within the interactive videogame, wherein the established identifier is selected from the group consisting of a fingerprint of the interactive videogame and a watermark from the interactive videogame, and wherein the event is selected from the group consisting of (i) performing of a particular set of tasks within the game, (ii) gaining of access to a feature of the game, and (iii) acquisition of a virtual item in the game; and
responsive to at least detecting that the established identifier matches the reference identifier corresponding with the event within the interactive videogame, causing presentation of a notification that references the occurrence of the event.

US Pat. No. 10,809,973

PLAYLIST SELECTION FOR AUDIO STREAMING

Gracenote Digital Venture...

1. A computer-implemented method comprising:receiving, by a server device, references to one or more audio files selected by a user;
storing, by the server device, the references to the one or more audio files in an account of the user;
receiving, by the server device and from a client device, a request for a playlist, wherein the client device is associated with the user, wherein the client device transmitted the request in response to determining that it is in an automobile, and wherein the client device determining that it is in the automobile comprises: (i) determining, from two or more global positioning system (GPS) measurements, that the client device is moving at a speed that exceeds a predetermined threshold speed for at least a pre-determined threshold amount of time, and (ii) confirming, using map data providing locations of roads and highways, that the client device is traveling along a road or highway;
obtaining, by the server device, the playlist, wherein the playlist contains the references to the one or more audio files and a reference to an audio file that was not selected by the user; and
transmitting, by the server device and to the client device, the playlist.

US Pat. No. 10,803,119

AUTOMATED COVER SONG IDENTIFICATION

GRACENOTE, INC., Emeryvi...

1. A computerized method comprising:accessing, using one or more hardware processors, reference audio to be represented by a reference data structure to be generated and stored in a reference database;
generating, using the one or more hardware processors, the reference data structure from the reference audio by at least:
performing a constant Q transform on multiple reference time slices of the reference audio;
binarizing the constant Q transformed reference time slices of the reference audio by, for each constant Q transformed reference time slice, calculating a median value of a range of constant Q transformed reference time slices that encompasses the constant Q transformed reference time slice and binarizing the constant Q transformed reference time slices based on the calculated median value of the range;
performing a two-dimensional Fourier transform on multiple reference time windows within the binarized and constant Q transformed reference time slices of the reference audio to obtain two-dimensional Fourier transforms of the reference time windows; and
sequentially ordering the two-dimensional Fourier transforms of the reference time windows in the reference data structure;
creating, within the reference database, a data association between the reference audio and the generated reference data structure that includes the sequentially ordered two-dimensional Fourier transforms of the reference time windows, the created data association indicating that the reference data structure is an identifier of the reference audio;
accessing, using the one or more hardware processors, metadata associated with the reference audio;
accessing, using the one or more hardware processors, a content source using the metadata to obtain query audio to be represented by a query data structure for comparison to the reference data structure that represents the reference audio;
comparing, using the one or more hardware processors, the query audio to the reference audio based on the query data structure and the reference data structure;
generating, using the one or more hardware processors, a ranking of the query audio based on the comparison; and
in response to the ranking satisfying a threshold, generating, using the one or more hardware processors, a notification based on identifying the query audio as a cover rendition of the reference audio.

US Pat. No. 10,785,530

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A system comprising:a display;
memory that stores instructions and a plurality of templates; and
one or more processors configured by the instructions to perform operations comprising:
accessing a video input stream that includes first content from a content provider and second content from an intermediate device, wherein, in a frame of the video input stream, the first content corresponds to a first screen portion of the display and the second content corresponds to a second screen portion of the display;
accessing an identifier of the intermediate device;
after accessing the identifier of the intermediate device, selecting a subset of the plurality of templates based on the identifier of the intermediate device, wherein the subset of the plurality of templates comprises more than one template of the plurality of templates;
after selecting the subset of the plurality of templates, comparing the video input stream to the subset of the plurality of templates to determine that the frame of the video input stream corresponds to an identified template, wherein the identified template indicates that the second screen portion comprises an overlay for displaying the second content from the intermediate device;
accessing the identified template;
responsive to a determination that the frame of the video input stream corresponds to the identified template, generating a video output stream comprising the second content for the second screen portion and third content for the first screen portion, wherein the third content differs from the first content of the video input stream; and
causing the video output stream to be presented on the display,
wherein generating the video output stream comprises:
determining, based on the identified template, a plurality of coordinates relating to a position and a size of the second screen portion, and
while accessing the video input stream, dynamically generating, using the plurality of coordinates, the video output stream with the second content at the second screen portion and the third content at the first screen portion.

US Pat. No. 10,783,863

MACHINE-CONTROL OF A DEVICE BASED ON MACHINE-DETECTED TRANSITIONS

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:a classifier to determine an audio characteristic value representative of an audio characteristic in audio;
a transition detector to detect a transition between a music category and a talk category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the music category and the talk category; and
a context manager to control a device to initiate extraction of fingerprints, responsive to the detected transition between the music category and the talk category.

US Pat. No. 10,733,985

DIGITAL VIDEO FINGERPRINTING USING MOTION SEGMENTATION

Gracenote, Inc., Emeryvi...

1. A method of processing video to generate fingerprints for motion segmented regions over two or more frames, the method comprising:differencing two frames using an adaptive threshold to generate a two-frame difference image;
creating a motion history image from the two-frame difference image;
segmenting the motion history image to generate one or more motion segmented regions; and
generating a descriptor and a fingerprint for a selected motion segmented region.