US Pat. No. 9,489,455

METHOD AND APPARATUS FOR MULTI-DIMENSIONAL CONTENT SEARCH AND VIDEO IDENTIFICATION

GRACENOTE, INC., Emeryvi...

1. A method of storing an object detected in a video clip in a multi-dimensional content database, the method comprising:
generating a shape signature using multi-bit histogram bins reduced to single bit histogram bins in sectored rings around
a keypoint of an object detected in a frame of the video clip; and

storing data associated with the object in the video clip at a leaf node of the multi-dimensional content database having
a plurality of leaf nodes, wherein the shape signature is used as an address to access the data associated with the object
in the video clip from the leaf node.

US Pat. No. 9,049,496

MEDIA SOURCE IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method comprising:
receiving an analog signal from a device that generated the analog signal as a result of the device playing a stream of media
that is being received by the device from a media source;

generating a representation of the analog signal received from the device that is playing the stream of media, the generating
of the representation of the analog signal being performed by a processor of a machine;

providing the representation of the analog signal to a server that stores a reference representation of the stream of media
as corresponding to an identifier of the media source from which the stream of media is being received by the device;

accessing the identifier of the media source from which the stream of media is being received by the device, the identifier
being provided by the server based on a comparison of the representation of the analog signal to the reference representation
of the stream of media;

detecting a remote control code that, when the detected remote control code is received by the device, causes the device to
request the stream of media from the media source; and

storing the accessed identifier provided by the server as corresponding to the stream of media being received by the device
and as corresponding to the detected remote control code that causes the device to request the stream of media.

US Pat. No. 9,510,057

INTERACTIVE PROGRAMMING GUIDE

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:
accessing a profile;
determining, by a machine having a memory and at least one processor, a personalized lineup based on the profile, the personalized
lineup comprising a corresponding media content identification assigned to each one of a plurality of sequential time slots,
each media content identification identifying media content for the corresponding time slot;

causing a first interactive programming guide to be displayed on a first media content device associated with the profile,
the first interactive programming guide comprising the personalized lineup;

receiving, via the first interactive programming guide, a request to assign a first different media content identification
to a first one of the time slots, the receiving the request to assign the first different media content identification to
the first one of the time slots comprising:

receiving a user selection of the first one of the time slots of the personalized lineup;
determining a plurality of media content category options based on the profile, each one of the plurality of media content
category options identifying a different category of media content;

in response to the received user selection of the first one of the time slots, causing the determined plurality of media content
category options to be displayed on the first media content device while maintaining the display of the first interactive
programming guide on the first media content device;

receiving a user selection of one of the media content category options;
in response to the received user selection of the one of the media content category options, determining a plurality of media
content options based on the profile;

causing the plurality of media content options to be displayed on the first media content device while maintaining the display
of the first interactive programming guide and the plurality of media content category options on the first media content
device; and

receiving a user selection of one of the media content options, the one of the media content options corresponding to the
first different media content identification; and

modifying the first one of the time slots of the personalized lineup in response to the request to assign the first different
media content identification to the first one of the time slots.

US Pat. No. 9,510,044

TV CONTENT SEGMENTATION, CATEGORIZATION AND IDENTIFICATION AND TIME-ALIGNED APPLICATIONS

GRACENOTE, INC., Emeryvi...

11. A method of providing time-aligned services on broadcast non-recorded content, the method comprising:
analyzing by an electronic client device, incoming video of a broadcast live event based on comparisons with opening sequences
of programs stored on the electronic client device to determine program details associated with the broadcast live event;

generating by the electronic client device, fingerprints of selected frames of the incoming video;
searching a reference database using the generated fingerprints to identify a likely match for content of the incoming video
with content of an original video stored in the reference database; and

correcting by the electronic client device, frame alignment in the incoming video based on a scale ratio mapping by utilizing
a scale ratio for an x axis (Sx) and a different scale ratio for a y axis (Sy) between the selected frames of the incoming
video and corresponding frames of a matched original video selected from the reference database and for generating subsequent
fingerprints based on the scale ratio mapping to enable a time aligned service to overlay on aligned video frames for display
on a display of the electronic client device information personalized for the electronic client device.

US Pat. No. 9,465,995

IDENTIFYING VIDEO CONTENT VIA COLOR-BASED FINGERPRINT MATCHING

Gracenote, Inc., Emeryvi...

18. A system, comprising:
a hardware processor-based fingerprint module configured to receive an identification query including a first fingerprint
and a second fingerprint, the first fingerprint being a color-based fingerprint derived from colors in a portion of the unknown
video content, the second fingerprint being at least partially based on a feature other than the colors of the same portion
of the unknown video content;

a hardware processor-based query module configured to perform the following operations responsive to the identification query:
query a reference database of reference fingerprints using one of the first fingerprint to obtain a candidate group of fingerprints
or the second fingerprint to obtain the candidate group of fingerprints, the candidate group of fingerprints being a subset
of the reference fingerprints queried;

query the candidate group of fingerprints using the other of the first fingerprint and the second fingerprint to identify
at least one query fingerprint; and

a hardware processor-based identification module configured to identify the unknown video content using the at least one query
fingerprint.

US Pat. No. 9,356,914

CONTENT-BASED ASSOCIATION OF DEVICE TO USER

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing, using one or more processors of a user device, first data corresponding to each of a first group of content items
to which access by a first application installed on the user device is confined by a first set of permissions;

accessing, using the one or more processors of the user device, second data corresponding to each of a second group of content
items to which access by a second application installed on the user device and distinct from the first application is confined
by a second set of permissions, the second group of content items and the first group of content items being distinct but
overlapping proper subsets of a third group of content items accessible by the user device;

generating, using the one or more processors of the user device via execution of a first module associated with the first
application and confined by the first set of permissions, a first device identifier of the user device based on the first
data;

generating, using the one or more processors of the user device via execution of a second module distinct from the first module
and associated with the second application and confined by the second set of permissions, a second device identifier of the
user device based on the second data; and

transmitting, using the one or more processors of the user device, the first and second device identifiers of the user device
from the user device to a service device configured to associate the user device with a user identifier of a user based on
a determination that the first and second device identifier both correspond to the user device.

US Pat. No. 9,195,663

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method for media content identification in multimedia content, the method comprising:
generating an N-dimension reference signature and a compact traversal hash reference signature based on an N-dimension reference
descriptor for a reference media content feature identified in a frame of a reference video sequence;

storing the N-dimension reference signature and a link to the reference media content feature at a location in a reference
database that is addressable by the compact traversal hash reference signature;

generating an N-dimension query signature and a compact traversal hash query signature based on an N-dimension query descriptor
for a query media content feature selected from a monitored video sequence; and

searching the reference database using the compact traversal hash query signature to select reference signatures that are
within a specified signature distance from the N-dimension query signature.

US Pat. No. 9,146,990

SEARCH AND IDENTIFICATION OF VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:
receiving a search query that includes a group of query fingerprints, each fingerprint of the group of query fingerprints
being associated with a frame of video content presented at a client device, the frames of video content associated with the
group of query fingerprints being sequential;

accessing an index of reference fingerprints, each reference fingerprint of the reference fingerprints corresponding to a
sequential frame of known video content, each reference fingerprint of the reference fingerprints having an associated first
value calculated from one or more large regions of the corresponding frame of known video content, each reference fingerprint
of the reference fingerprints having an associated second value calculated from one or more small regions of the corresponding
frame of known video content;

comparing the group of query fingerprints to reference fingerprints of the index;
identifying two or more match candidates based on a comparison of the group of query fingerprints to the first values associated
with the reference fingerprints;

selecting one or more of the identified match candidates based on a comparison of the group of query fingerprints to the second
values associated with the identified match candidates; and

identifying the video content presented at the client device based on the selected match candidates.

US Pat. No. 9,143,718

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:
a processor configured by a comparator unit to:
receive a main datastream and a reference datastream,
the main datastream including main content elements intermixed with inserted content elements,
the reference datastream having a different data rate than the main datastream and including reference fingerprints of the
main content elements of the main datastream;

compute main fingerprints from at least some of the main content elements included in the main datastream; and
a control unit configured to select the main content elements from the main datastream based on a comparison of the main fingerprints
to the reference fingerprints.

US Pat. No. 9,501,568

AUDIO MATCHING BASED ON HARMONOGRAM

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing, using one or more processors, audio data that represents query sound to be identified;
creating, using the one or more processors, a spectrogram of the audio data, different segments of the spectrogram representing
amplitudes at frequencies in different time slices of the query sound;

determining, using the one or more processors, a first most dominant frequency in a time slice of the query sound based on
a segment of the spectrogram, the determining including:

calculating a first aggregate energy value of a first candidate frequency based on amplitudes of the first candidate frequency
and harmonics thereof represented in the segment of the spectrogram; and

identifying the first candidate frequency as the first most dominant frequency based on the first aggregate energy value of
the first candidate frequency being a first largest aggregate energy value among aggregate energy values of frequencies whose
amplitudes are represented in the segment of the spectrogram;

determining, using the one or more processors, a second most dominant frequency that indicates a second strongest harmonic
component of the time slice, the determining of the second most dominant frequency including:

calculating a second aggregate energy value of a second candidate frequency based on amplitudes of the second candidate frequency
and harmonics thereof represented in the segment of the spectrogram; and

identifying the second candidate frequency as the second most dominant frequency based on the second aggregate energy value
of the second candidate frequency being a second largest aggregate energy value among the aggregate energy values of the frequencies
whose amplitudes are represented in the segment of the spectrogram;

creating, using the one or more processors, a query harmonogram of the audio data, different segments of the query harmonogram
representing aggregate energy values of dominant frequencies in different time slices of the query sound, the creating of
the query harmonogram including indicating the first and second aggregate energy values in a same segment of the query harmonogram;
and

providing, using the one or more processors, an identifier of the query sound based on a comparison of the query harmonogram
to a reference harmonogram mapped to the identifier by a database.

US Pat. No. 9,160,837

INTERACTIVE STREAMING CONTENT APPARATUS, SYSTEMS AND METHODS

Gracenote, Inc., Emeryvi...

1. A method comprising:
continuously receiving an incoming audio signal including audio content;
processing the audio signal to determine audio characteristics that include a rhythm of the audio content;
classifying the audio content into a classification among multiple classifications that include a music classification and
a talk classification based on the audio characteristics that include the rhythm of the audio content and based on multiple
sets of threshold values that each correspond to a different one of the multiple classifications;

identifying a transition between the music classification and the talk classification by detecting that a change in the rhythm
of the audio content transgresses at least one of the multiple sets of threshold values;

in response to the transition between the music classification and the talk classification being identified by the detecting
that the change in the rhythm of the audio content transgresses at least one of the multiple sets of threshold values, adjusting
at least one of the multiple sets of threshold values; and

selecting a behavior of a device in response to the identified transition between the music classification and the talk classification.

US Pat. No. 9,804,816

GENERATING A PLAYLIST BASED ON A DATA GENERATION ATTRIBUTE

Gracenote Digital Venture...

1. A method comprising:
determining a data generation attribute associated with an audible version of a story, wherein the data generation attribute
indicates that the audible version of the story was generated either by recording a human verbal reading of a text based version
of the story, or by a text-to-speech system operating on the text based version of the story;

based on the data generation attribute, generating a playlist that includes a reference to the audible version of the story;
and

transmitting the playlist to a client device, wherein reception of the playlist causes the client device to:
read the reference from the playlist,
transmit, to a server associated with the reference, a request for the audible version of the story,
receive, from the server, the audible version of the story, and
audibly play out the audible version of the story.

US Pat. No. 9,323,754

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method for creating a media database, the method comprising:
generating an N-dimension descriptor for a selected image feature in a selected frame of a video clip, the N-dimension descriptor
being determined from a matrix of pixels placed on the selected image feature at {center x, center y} coordinates and having
a spatial extent determined from an x direction scale factor sx and a y direction scale factor sy, wherein the N-dimension
descriptor comprises N digit values arranged in H groups of D dimensions;

applying a first threshold to the N-dimension descriptor to generate an N-dimension signature having N binary values;
generating a combined descriptor based on the N-dimension descriptor by combining the digit values in each group of D dimensions
to create a combined digit value for each group, wherein the combined descriptor comprises H=N/D combined digit values; and

applying a second threshold to each combined digit value of the combined descriptor to generate a traversal hash signature
having H binary values, wherein the traversal hash signature, associated N-dimension signature, associated N-dimension descriptor,
and associated {center x, center y, sx, sy} values are stored in a media database.

US Pat. No. 9,798,715

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:
accessing first data representing text, wherein the text defines at least two grammatical break positions representing a particular
type of grammatical break between two portions of the text;

identifying, from among the at least two grammatical break positions, a position that is closest to a target position within
the text, wherein the target position is within a range from a first position to a second position, wherein the first position
is positioned after 100 words from a beginning of the text, and wherein the second position is positioned before 130 words
from the beginning of the text;

based on the identified position within the text, generating second data that represents a proper subset of the text, wherein
the proper subset extends from an initial position within the text to the identified position within the text; and

providing output based on the generated second data.

US Pat. No. 9,367,544

METHOD FOR EFFICIENT DATABASE FORMATION AND SEARCH ON MEDIA DEVICES ACTING SYNCHRONOUSLY WITH TELEVISION PROGRAMMING

Gracenote, Inc., Emeryvi...

1. A method for efficiently querying a database on a media device, the method comprising:
forming a reference cache database with reference fingerprints on the media device partitioned for multiple search levels
configured for query searching, wherein each search level represents a larger portion of the reference cache database than
a previous search level;

searching for media content captured on the media device using fingerprints of the captured media content in response to a
local query search in each search level in progressively larger portions of the reference cache database on the media device
with automatic switching between a local search on the media device and a remote search on an external search database upon
not finding reference fingerprints that match fingerprints of the captured media content in any search level portion of the
reference cache database; and

upon finding matching fingerprints in the remote search, downloading and storing a chunk of reference media content fingerprints,
from a neighborhood of matching reference fingerprints on the external search database, in the reference cache database on
the media device, whereby the reference cache database is updated with current search area reference material, wherein the
external search database sends down to one of the search level portions of the media device a chunk of reference fingerprints
for further local querying and media content tracking if the queries for the media content captured on the media device are
found in the external search database and wherein for a second level query, a reference search range is limited to a time
span around a previous successful search match, plus or minus N seconds and wherein an internal second level counter of search
misses is incremented if the second level query is not found in the reference cache database.

US Pat. No. 9,299,364

AUDIO CONTENT FINGERPRINTING BASED ON TWO-DIMENSIONAL CONSTANT Q-FACTOR TRANSFORM REPRESENTATION AND ROBUST AUDIO IDENTIFICATION FOR TIME-ALIGNED APPLICATIONS

Gracenote, Inc., Emeryvi...

1. A method for robust fingerprinting of audio signals in a processor, the method comprising:
structuring a received one dimensional (1D) audio signal into overlapping audio frames;
applying a constant Q-factor transform (CQT) to the overlapping audio frames to generate a two dimensional (2D) CQT data structure
representation of the audio frames;

processing octaves of the 2D CQT data structure to determine regions of interest within the 2D CQT data structure and peak
interest points within selected interest regions;

generating multidimensional descriptors in windows around the peak interest points;
applying a quantizer threshold to the multidimensional descriptors to generate audio signatures representing the received
1D audio signal; and

applying a compacting discrete cosine transform (DCT) on a set of regularly spaced regions generated across one or more octaves
in 2D CQT array frequency axes directions, wherein the multidimensional descriptors are generated with a length based on a
PxQ descriptor box and on generated DCT coefficients.

US Pat. No. 9,479,831

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A method comprising:
playing back multimedia content on a multimedia playback device, including providing at least some of the multimedia content
on a display associated with the multimedia playback device;

determining, by a processor of the multimedia playback device, a trigger fingerprint from a segment of the multimedia content
being played back on the multimedia playback device;

accessing a plurality of reference fingerprints, each reference fingerprint among the plurality of reference fingerprints
having been previously derived from a respective segment of the multimedia content and associated with at least one corresponding
reference action;

obtaining a match between the trigger fingerprint and a particular reference fingerprint among the plurality of reference
fingerprints;

identifying a particular corresponding reference action associated with the particular reference fingerprint, the particular
corresponding reference action being associated with a time point indicating when, in the multimedia content, the particular
corresponding reference action is to be performed; and

performing, by the processor of the multimedia playback device, the particular corresponding reference action, wherein performance
of the particular reference action causes the multimedia playback device to provide, on the display associated with the multimedia
playback device, information related to the segment of the multimedia content.

US Pat. No. 9,262,421

DISTRIBUTED AND TIERED ARCHITECTURE FOR CONTENT SEARCH AND CONTENT MONITORING

Gracenote, Inc., Emeryvi...

1. A method for fast updating of a search database, the method comprising:
storing signatures of a real time database update in sequential order as received in a buffer on a user device;
sending the signatures from the user device to a remote database without locks; and
updating a remote database with the signatures of the real time database update.

US Pat. No. 9,414,008

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:
a processor configured by a comparator unit to:
receive a main datastream and a reference datastream, the main datastream including a plurality of main content elements,
the reference datastream including an item of mark-up information associated with a reference fingerprint;

compute main fingerprints from at least some of the main content elements included in the main datastream; and
link a part of the main datastream to the item of mark-up information based on a comparison between the reference fingerprint
and one or more of the main fingerprints.

US Pat. No. 9,313,359

MEDIA CONTENT IDENTIFICATION ON MOBILE DEVICES

Gracenote, Inc., Emeryvi...

1. A method for audio fingerprinting by using content-based audio feature extraction, the method comprising:
analyzing input audio samples, divided into overlapping frames, to produce windowed audio frame samples for each overlapped
frame;

computing a fast Fourier transform (FFT) for the windowed audio frame samples which FFT results are filtered by a filter-bank
on the spectral components to produce a set of filter bank output coefficients;

computing a log function and a square root function of each set of filter bank output coefficients to produce log and square
root output coefficients;

applying a discrete cosine transform (DCT) separately to the log and square root output coefficients to produce two sets of
DCT spectral components; and

applying a temporal multi-tap finite impulse response (FIR) smoothing derivative filter to the two sets of DCT spectral components
to produce two separate primary descriptors, wherein values in the two separate primary descriptors are sampled to produce
two primary signatures.

US Pat. No. 9,106,804

SYNTHESIZING A PRESENTATION OF A MULTIMEDIA EVENT

Gracenote, Inc., Emeryvi...

1. A media synchronization system to synthesize a presentation of a multimedia event, the system comprising:
a media ingestion module configured to access a plurality of media clips received from a plurality of client devices that
include a first client device from which a first media clip with a first audio waveform is received and a second client device
from which a second media clip with a second audio waveform is received;

processing circuitry configured by a media analysis module to determine a temporal overlap of the first media clip received
from the first client device with the second media clip received from the second client device, the temporal overlap of the
first and second media clips being determined by matching a first fingerprint of at least part of the first media clip with
a second fingerprint of at least part of the second media clip; and

a content creation module configured to merge the first media clip and the second media clip into a group of overlapping media
clips based on the temporal overlap determined by the matching of the first fingerprint with the second fingerprint, and to
generate the presentation to include the group formed by merging the first and second media clips based on the temporal overlap.

US Pat. No. 9,066,114

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A method comprising:
determining a plurality of trigger fingerprints from content being played back on a playback device, each trigger fingerprint
identifying a corresponding trigger time point of a sequence of trigger time points in the content and at which trigger time
point at least one corresponding action is to be triggered, the determining being performed by a processor;

accessing a database that includes a plurality of reference fingerprints, previously derived from the content, and a plurality
of reference actions, each reference fingerprint identifying at least one corresponding reference action to be performed at
a reference time point identified by the reference fingerprint;

identifying the corresponding reference action by obtaining a match in the database between a trigger fingerprint among the
plurality of trigger fingerprints and a reference fingerprint among the plurality of reference fingerprints; and

performing a reference action that corresponds to the reference fingerprint on the playback device.

US Pat. No. 9,286,902

AUDIO FINGERPRINTING

Gracenote, Inc., Emeryvi...

1. A method comprising:
generating a spectral representation of a segment of audio data, the spectral representation indicating energy values for
a set of frequencies;

multiplying each energy value by a corresponding weight factor determined based on an ordinal position of a corresponding
frequency within the set of frequencies;

using a processor, generating a sparse vector that contains a zero value for each frequency in the set of frequencies except
for representing a first group of highest energy values from a first portion of the set of frequencies with a common value
and representing a second group of highest energy values from a second portion of the set of frequencies with the common value,
the first group being determined based on ranked energy values for frequencies above a threshold frequency, the second group
being determined based on ranked energy values for frequencies below the threshold frequency;

generating an ordered set of permutations of the sparse vector, each permutation in the ordered set of permutations being
generated in a corresponding manner that repositions instances of the common value to permutate the sparse vector;

generating an ordered set of numbers from the ordered set of permutations of the sparse vector, each number in the ordered
set of numbers representing a corresponding permutation by indicating a position of an instance of the common value within
the corresponding permutation; and

generating a fingerprint of the segment of the audio data based on the ordered set of numbers generated from the ordered set
of permutations of the sparse vector.

US Pat. No. 9,495,451

IDENTIFYING VIDEO CONTENT VIA FINGERPRINT MATCHING

Gracenote, Inc., Emeryvi...

1. A computer-implemented method, comprising:
accessing a query fingerprint of a single video frame of video content captured at a client device, the query fingerprint
being generated based on a first value of a first spatial region within the single video frame of video content, the first
value of the first spatial region being calculated by subtracting a spatial sub-region of the first spatial region from one
or more other spatial sub-regions of the first spatial region;

querying a database of reference fingerprints associated with video frames of known video content, each of the reference fingerprints
being generated based on spatial regions within a different single video frame of the known video content;

determining that the query fingerprint matches at least one of the reference fingerprints; and
identifying the video content captured at the client device based on the determination that the query fingerprint matches
at least one of the reference fingerprints.

US Pat. No. 9,798,509

USE OF AN ANTICIPATED TRAVEL DURATION AS A BASIS TO GENERATE A PLAYLIST

GRACENOTE DIGITAL VENTURE...

1. A method comprising:
determining, by a server device, an anticipated travel duration of a client device;
based on the anticipated travel duration, selecting, by the server device, a playlist template from a plurality of playlist
templates, wherein the selected playlist template includes a sequence of audio content placeholders, wherein the placeholders
in the sequence are associated with respective quantity tags and respective type tags, wherein the quantity tags specify integer
numbers of media files to be selected for the associated placeholder, wherein the type tags specify media content types of
the media files to be selected for the associated placeholder, and wherein a duration of the playlist template is within a
threshold of the anticipated travel duration;

generating, by the server device, a playlist from the playlist template, wherein generating the playlist comprises, for each
placeholder in the sequence, (i) selecting one or more media files as specified by the quantity tag and the type tag, respectively,
of the placeholder, and (ii) including, in the playlist, references to audible versions of the one or more media files selected;
and

transmitting the generated playlist to the client device.

US Pat. No. 9,323,841

METHOD AND APPARATUS FOR MULTI-DIMENSIONAL CONTENT SEARCH AND VIDEO IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method of storing a video clip in a multi-dimensional content database, the method comprising:
partitioning a multi-dimensional content database by a plurality of pivots, wherein each pivot is a center point around which
corresponding vector signatures are organized and wherein each pivot is coupled to a plurality of leaf nodes;

generating a multi-dimensional vector signature from pre-specified and measurable characteristics that are extracted from
a region of interest in a frame of a video clip having multi-dimensional content and that exceed a pre-specified threshold,
wherein the multi-dimensional vector signature includes a compare pivot component that supports traversal from a pivot selected
from the plurality of pivots to a leaf node selected from a plurality of associated leaf nodes;

storing data associated with the video clip at the selected leaf node of the multi-dimensional content database wherein the
selected pivot is coupled to the plurality of associated leaf nodes including the selected leaf node; and

accessing the data associated with the video clip utilizing the multi-dimensional vector signature to select a pivot by matching
the compare pivot component to the selected pivot and having an address to the data stored in the selected leaf node of the
multi-dimensional content database.

US Pat. No. 9,760,336

USE OF AN ANTICIPATED TRAVEL DURATION AS A BASIS TO GENERATE A PLAYLIST

GRACENOTE DIGITAL VENTURE...

1. A method comprising:
determining, by a server device, an anticipated travel duration of a client device;
based on the anticipated travel duration, selecting, by the server device, a playlist template from a plurality of playlist
templates, wherein the selected playlist template includes a sequence of audio content placeholders, wherein the placeholders
in the sequence are associated with respective quantity tags and respective type tags, wherein the quantity tags specify integer
numbers of media files to be selected for the associated placeholder, wherein the type tags specify media content types of
the media files to be selected for the associated placeholder, and wherein a duration of the playlist template is within a
threshold of the anticipated travel duration;

generating, by the server device, a playlist from the playlist template, wherein generating the playlist comprises, for each
placeholder in the sequence, (i) selecting one or more media files as specified by the quantity tag and the type tag, respectively,
of the placeholder, and (ii) including, in the playlist, references to audible versions of the one or more media files selected;
and

transmitting the generated playlist to the client device.

US Pat. No. 9,372,531

DETECTING AN EVENT WITHIN INTERACTIVE MEDIA INCLUDING SPATIALIZED MULTI-CHANNEL AUDIO CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing a first fingerprint generated from media content that is presentable as part of an interactive media presentation
within which an event that corresponds to the media content is configured to occur in response to a user input, the media
content including spatialized multi-channel audio content that indicates a virtual directional orientation towards which a
player avatar is facing within a virtual world;

generating a second fingerprint from a playback of the media content as part of the interactive media presentation;
detecting, using at least one processor of a machine, an occurrence of the event within the interactive media presentation
based on the second fingerprint matching the first fingerprint; and

presenting a notification that references the occurrence of the event within the interactive media presentation detected based
on the second fingerprint matching the first fingerprint and includes information that corresponds to the virtual directional
orientation within the virtual world.

US Pat. No. 9,323,840

VIDEO FINGERPRINTING

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing a query fingerprint that includes a first component based on a plurality of patches of a first frame of a set of
frames of video content captured at a client device, the first component composed of bits of data, the bits of data representing
at least two patches of the plurality of patches;

querying a database of reference fingerprints associated with frames of known video content, each reference fingerprint having
a plurality of components, each component of the plurality of components of each reference fingerprint based on a corresponding
frame of known video content;

determining that the query fingerprint matches at least one of the reference fingerprints, a number of frames in the set of
frames of video content on which the query fingerprint is based being fewer than a number of frames used to generate the at
least one matching reference fingerprint;

identifying, by a processor of a machine, the video content captured at the client device based on the determination that
the query fingerprint matches at least one of the reference fingerprints; and

sending an identifier to the client device, the identifier corresponding to the identified video content.

US Pat. No. 10,031,899

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:accessing first data representing text;
identifying a term within the represented text;
using the identified term as a basis to select a text-editing rule from among a set of text-editing rules;
generating second data that represents the represented text edited in accordance with the selected text-editing rule; and
providing output based on the generated second data.

US Pat. No. 9,380,383

MODIFYING PLAYBACK OF CONTENT USING PRE-PROCESSED PROFILE INFORMATION

Gracenote, Inc., Emeryvi...

1. A method, comprising:
accessing, at a computer system, a stream of media content to be delivered via a communication network to a playback device
configured to play the media content;

identifying, using the computer system, a digital fingerprint for a piece of media content within the stream of media content
to be delivered to the playback device;

determining, using the computer system, profile information for the identified piece of media content, the profile information
identifying playback frequency characteristics of the piece of media content; and

providing the digital fingerprint and the determined profile information to the playback device independently of the stream
of media content, the digital fingerprint and the profile information to allow the playback device to identify the piece of
media content and automatically modify the playback frequency characteristics during playback of the piece of media content
based on the profile information.

US Pat. No. 9,407,962

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing a database that associates a first fingerprint with a first action, the first fingerprint being for a first trigger
time point of a plurality of trigger time points within a multimedia signal, each trigger time point of the plurality of trigger
time points corresponding to a segment of a plurality of segments of the multimedia signal;

receiving a second trigger time point of the plurality of trigger time points and a second action;
deriving, by a processor of a machine, a second fingerprint based on a segment of the multimedia signal that corresponds to
the second trigger time point; and

updating the database to associate the second fingerprint with the second action.

US Pat. No. 9,798,513

AUDIO CONTENT FINGERPRINTING BASED ON TWO-DIMENSIONAL CONSTANT Q-FACTOR TRANSFORM REPRESENTATION AND ROBUST AUDIO IDENTIFICATION FOR TIME-ALIGNED APPLICATIONS

GRACENOTES, INC., Emeryv...

1. A method for robust fingerprinting of an audio signal in a processor, the method comprising:
applying a constant Q-factor transform (CQT) to an audio frame selected from a sequence of overlapping audio frames to generate
a two dimensional (2D) CQT data structure of CQT coefficients;

establishing descriptor generation boxes each with a spatial extent of PĂ—Q in regular intervals across a selected number of
octaves of the 2D CQT data structure, wherein P is an extent in a first dimension and Q is an extent in a second dimension;

applying a discrete cosine transform (DCT) to each of the descriptor generation boxes to produce transformed coefficient boxes
comprising transformed DCT coefficients; and

generating descriptors for each of the transformed coefficient boxes to represent the selected audio frame, wherein each descriptor
has a length based on a selected number of the transformed DCT coefficients.

US Pat. No. 9,560,102

MEDIA SOURCE IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method comprising:
detecting a control code communicated from a controller device to a player device, the control code causing the player device
to play a media stream available from a media source;

accessing an analog signal produced by the player device in playing the media stream from the media source;
generating a representation of the analog signal;
obtaining an identifier of the media source based on a comparison of the generated representation of the analog signal to
a reference representation of the media stream; and

assigning the identifier of the media source to the control code that was communicated to the player device from the control
device and that caused the player device to play the media stream available from the media source identified by the identifier.

US Pat. No. 9,436,689

DISTRIBUTED AND TIERED ARCHITECTURE FOR CONTENT SEARCH AND CONTENT MONITORING

Gracenote, Inc., Emeryvi...

1. A method for fast updating of a search database, the method comprising:
creating two duplicate databases using an initial set of reference signatures;
searching on a first database of the two duplicate databases as the active database;
updating a second database of the two duplicate databases as the standby database with new signatures for new content without
locks;

switching the standby database with the active database to create a new standby database; and
updating the new standby database with the new signatures for the new content without locks.

US Pat. No. 9,396,393

DIGITAL VIDEO CONTENT FINGERPRINTING BASED ON SCALE INVARIANT INTEREST REGION DETECTION WITH AN ARRAY OF ANISOTROPIC FILTERS

Gracenote, Inc., Emeryvi...

1. A method for interest point descriptor and signature generation used in a video sequence fingerprinting process, the method
comprising:
generating interest point parameter vectors for selected frames of a video sequence having a plurality of video frames, wherein
each generated vector contains a set (sx, sy, x, y, pixel maximum value), wherein x, y are coordinates of interest point location, sx is a scale parameter in the x direction, and sy is a scale parameter in the y direction representing a scale pair (sxq, syr) at the interest point (x,y) location for a Q by R scale neighborhood, wherein Q and R are integers, xq?{x0,x1, . . .x(Q?1)},
yr?{y0,y1, . . .y(R?1), and the pixel maximum value is determined from a sorting of local pixel maximum vales according to
their magnitudes;

establishing for each generated vector a rectangular box encompassing a spatial extent, in pixels, of Nsx by Msy and centered at the interest point (x, y) location assciated with the set (sx, sy, x, y, pixel maximum value), wherein N and M are multiplication factors which determine the size in pixel of a neighborhood
around the interest point; and

determining for each generated vector a multi-dimensional descriptor and a multi-dimensional signature based on pixels in
the rectangular box, wherein a collection of multi-dimensional descriptors and multi-dimensional signatures for the generated
vectors of each selected frame of the video sequence comprises a compact representation of the video sequence used in the
video sequence fingerprinting process.

US Pat. No. 10,019,225

AUDIO STREAMING BASED ON IN-AUTOMOBILE DETECTION

Gracenote Digital Venture...

1. A method comprising:receiving, by a client device, an operation that indicates that one or more references to audio files are to be saved for in-automobile playout;
transmitting, by the client device, the one or more references saved for in-automobile playout to a profile server device, wherein reception of the one or more references saved for in-automobile playout causes the profile server device to generate a playlist including the one or more references saved for in-automobile playout;
determining, by the client device, that the client device is in an automobile;
in response to determining that the client device is in the automobile, retrieving, by the client device and from the profile server device, the playlist including the one or more references saved for in-automobile playout;
based on the client device being in the automobile and having retrieved the playlist including the one or more references saved for in-automobile playout, requesting, by the client device, a stream of the audio files from one or more audio server devices;
receiving, by the client device, the stream of the audio files from the one or more audio server devices; and
causing, by the client device, audible playout of the stream of the audio files.

US Pat. No. 9,213,747

SYSTEMS, METHODS, AND APPARATUS FOR GENERATING AN AUDIO-VISUAL PRESENTATION USING CHARACTERISTICS OF AUDIO, VISUAL AND SYMBOLIC MEDIA OBJECTS

Gracenote, Inc., Emeryvi...

1. A method of creating an audio-visual presentation, the method comprising:
identifying a first plurality of media object characteristics associated with a first media object to be presented by a media
player device;

determining a congruency, while the audio visual presentation is being presented, the congruency being between the first plurality
of media object characteristics and a second plurality of media object characteristics associated with a second media object,
the first media object including audio data and the second media object including visual data which are simultaneously presented,
the determining of the congruency including accessing a first score from a first correlation matrix that correlates a first
media object characteristic among the first plurality of media object characteristics to a second media object characteristic
among the second plurality of media object characteristics, the first score representing a first degree of congruency between
the first media object characteristic and the second media object characteristic, the determining of the congruency further
including accessing a second correlation matrix that includes a second score that represents a second degree of congruency
between the first media object and a non-media entity, the determining of the congruency being based on the first score transgressing
a threshold value;

identifying the second media object from a plurality of reference media objects based on the determined congruency between
the first plurality of media object characteristics and the second plurality of media object characteristics and based on
the second degree of congruency between the first media object and the non-media entity; and

using a processor, dynamically generating the audio-visual presentation to synchronously present the first media object and
the second media object on the media player device.

US Pat. No. 9,652,672

DIGITAL VIDEO CONTENT FINGERPRINTING BASED ON SCALE INVARIANT INTEREST REGION DETECTION WITH AN ARRAY OF ANISOTROPIC FILTERS

GRACENOTE, INC., Emeryvi...

1. A method of image scale-space analysis that applies an array of anisotropic filters for interest region determination,
the method comprising:
forming an array of elliptic-shaped anisotropic second order partial derivative Gaussian filters with a first scale parameter
in the x direction (sx) and a second scale parameter in the y direction (sy) as scale parameters (sx, sy);

forming an array of rectangular-shaped anisotropic second order partial derivative Gaussian filters with finite spatial support
according to the (sx, sy) scale parameters to approximate the elliptic-shaped anisotropic second order partial derivative Gaussian filters;

analyzing scale space for a selected image frame by the array of rectangular-shaped anisotropic second order partial derivative
Gaussian filters of finite spatial support to determine at least one interest point corresponding to different (sx, sy) scale parameters, wherein an interest point identifies a location of a distinct image region in the selected image frame;
and

convolving a region of pixels in a surrounding neighborhood of the determined at least one interest point with the array of
rectangular anisotropic second order partial derivative Gaussian filters to produce at least one interest point result that
is resistant to image distortion.

US Pat. No. 10,019,416

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:accessing, by a computing system, first data defining multiple portions of a news article, wherein the multiple portions of the news article comprise at least one of a plurality of portions including text, an image, or a hyperlink;
selecting, by the computing system, from the plurality of portions including text, a subset of the portions including text, wherein the selecting is based on each portion of the selected subset having a particular characteristic, wherein each portion of the selected subset having the particular characteristic comprises each portion of the selected subset representing text and each portion of the selected subset including content from at least one section from a predefined set of sections of the news article, and further wherein the content from the at least one section of the predefined set of sections is suitable for inclusion in an audible news story;
based on the text included in the portions of the selected subset, generating, by the computing system, text-based second data that includes a concatenation of the text included in the portions of the selected subset; and
providing, by the computing system, output based on the generated text-based second data.

US Pat. No. 9,646,007

DISTRIBUTED AND TIERED ARCHITECTURE FOR CONTENT SEARCH AND CONTENT MONITORING

GRACENOTE, INC., Emeryvi...

1. A method for content searching, the method comprising:
generating fingerprints, in a user device, for a unit of multimedia content;
searching for the generated fingerprints in a found query (FQ) cache and in an unidentified query (UIQ) cache;
searching in a remote reference database if the generated fingerprints are not found in the FQ cache or in the UIQ cache;
and

storing the unit of multimedia content and the generated fingerprints in the UIQ cache upon not finding the generated fingerprints
in the remote reference database, wherein subsequent searches of the unit of multimedia content are found in the UIQ cache
without searching the remote reference database.

US Pat. No. 10,198,243

AUDIO STREAMING BASED ON IN-AUTOMOBILE DETECTION

Gracenote Digital Venture...

1. An in-automobile audio system comprising:a speaker configured to produce audio output;
a communication interface configured to access a wide-area network;
a processor; and
memory containing program instructions that, when executed by the processor, cause the in-automobile audio system to:
transmit, by way of the communication interface and to a profile server device, a request for a playlist, wherein the playlist contains references to audio files, the references previously stored on the profile server device as part of a save-to-automobile operation;
receive, by way of the communication interface and from the profile server device, the playlist;
based on the references in the playlist, transmit, by way of the communication interface and to an audio server device, a request for streaming of one or more of the audio files;
receive, by way of the communication interface and from the audio server device, a stream of the requested audio files; and
play out, by way of the speaker, the requested audio files.

US Pat. No. 9,876,982

TEXT DETECTION IN VIDEO

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:
identifying, by a machine having a memory and at least one processor, a portion of video content as having text, the identifying
the portion of the video content comprising:

performing a connected component analysis on a frame of the video content to detect connected components within the frame;
merging the connected components into a plurality of text lines;
refining the plurality of text lines using horizontal and vertical projections in order to remove one or more text lines from
the plurality of text lines;

filtering out at least one of the plurality of text lines based on a size of the at least one of the plurality of text lines
to form a filtered set of text lines;

binarizing the filtered set of text lines formed by the filtering out of the at least one of the plurality of text lines;
and

filtering out at least one of the text lines from the binarized filtered set of text lines based on at least one of a shape
of components in the at least one of the text lines and a position of components in the at least one of the text lines to
form the portion of the video content having text

identifying the text within the identified portion of the video content;
determining a category for the identified text;
determining additional information corresponding to the video content based on the identified text and the determined category;
and

causing a software application on a media content device to perform a function using the additional information, the function
corresponding to the determined category.

US Pat. No. 9,877,079

INTERACTIVE PROGRAMMING GUIDE

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:
causing a first interactive programming guide to be displayed on a first media content device, the first interactive programming
guide comprising a personalized lineup, the personalized lineup comprising a corresponding media content identification assigned
to each one of a plurality of sequential time slots, each media content identification identifying media content for the corresponding
time slot;

receiving, via the first interactive programming guide, a request to assign a first different media content identification
to a first one of the time slots, the receiving the request to assign the first different media content identification to
the first one of the time slots comprising:

receiving a user selection of the first one of the time slots of the personalized lineup;
in response to the received user selection of the first one of the time slots, causing a plurality of media content category
options to be displayed on the first media content device while maintaining the display of the first interactive programming
guide on the first media content device, each one of the plurality of media content category options identifying a different
category of media content;

receiving a user selection of one of the media content category options;
in response to the received user selection of the one of the media content category options, determining a plurality of media
content options;

causing the plurality of media content options to be displayed on the first media content device while maintaining the display
of the first interactive programming guide and the plurality of media content category options on the first media content
device; and

receiving a user selection of one of the media content options, the one of the media content options corresponding to the
first different media content identification; and

modifying the first one of the time slots of the personalized lineup in response to the request to assign the first different
media content identification to the first one of the time slots.

US Pat. No. 10,057,638

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a digital fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the digital fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing by the computing system disambiguation based at least in part on a location of the media presentation device, to determine which channel the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple reference fingerprints, determining that the respective channel with which the particular reference fingerprint corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.

US Pat. No. 9,596,490

AUTHORIZING DEVICES BASED ON IDENTIFYING CONTENT DISTRIBUTOR

Gracenote, Inc., Emeryvi...

1. A system comprising:
a set of watching stations each configured to:
access a video stream being distributed by a content distribution system among multiple content distribution systems, the
video stream including same video content provided by a single content source to the multiple content distribution systems
and including distributor-specific content that is unique to the content distribution system that distributes the video stream,
the video stream being available from the content distribution system of the multiple content distribution systems to a client
device;

generate a set of reference fingerprints of the distributor-specific content included within the video stream accessed from
the content distribution system; and

associate the set of reference fingerprints with the content distribution system that distributes the video stream;
an identification module that is configured to:
access a set of query fingerprints received from the client device, a set of query fingerprints being calculate from at least
a portion of the distributor-specific content playing at the client device;

query a reference database that includes the sets of reference fingerprints to identify a set of reference fingerprints that
corresponds to the set of query fingerprints; and

identify the content distribution system among the multiple content distribution systems by determining a time delay between
a first time associated with a transmission of a frame of video content from the single content source to the content distribution
system and a second time associated with a presentation of the frame of video content by the content distribution system via
the client device;

a location module that is configured to identify a location of the client device; and
an action module that is configured to perform an action dependent upon the identifying of the content distribution system
among the multiple content distribution systems and the location of the client device.

US Pat. No. 10,048,931

MACHINE-LED MOOD CHANGE

Gracenote, Inc., Emeryvi...

1. A system comprising:a biometric sensor to detect a biometric measurement of a user; and
a media selector machine including:
a play count accessor to access play counts that quantify playback of media files;
a media analyzer to:
access multi-dimensional user state scores of the media files; and
generate a distribution of the play counts over the multi-dimensional user state scores;
a target selector to:
determine a current multi-dimensional user state score of the user based on the biometric measurement of the user; and
select a target multi-dimensional user state score based on the current multi-dimensional user state score; and
a media selector to select a media file for playback to the user based on the distribution of the play counts and the target multi-dimensional user state score.

US Pat. No. 9,792,084

MACHINE-LED MOOD CHANGE

GRACENOTE, INC., Emeryvi...

1. A method comprising:
accessing, by one or more processors of a machine, play counts that quantify playbacks of media files played for a user, the
play counts being accessed from a playback history database of the user;

accessing, by one or more processors of the machine, arousal scores of the media files played for the user, the arousal scores
indicating emotional energy levels that are represented within a three-dimensional pleasure-arousal-dominance model of emotional
states and are humanly perceivable in the playbacks of the media files;

generating, by one or more processors of the machine, a distribution of the play counts over the arousal scores, the distribution
pairing play counts to corresponding arousal scores, the distribution including one or more relative maximum of play counts
that each correspond to a different arousal score among the arousal scores;

detecting a biometric measurement of the user via a biometric sensor;
determining, by one or more processors of the machine, a current arousal score of the user based on the biometric measurement
of the user;

selecting, by one or more processors of the machine, a target arousal score for the user based on contextual data that describes
an activity in which the user is engaged;

identifying, by one or more processors of the machine, a first relative maximum of the one or more relative maximum having
a first arousal score closest to the target arousal score and based on the current arousal score of the user; and

selecting, by one or more processors of the machine, a media file for playback to the user based on the media file having
an arousal score at or near the first arousal score of the first relative maximum.

US Pat. No. 10,275,212

AUDIO STREAMING BASED ON IN-AUTOMOBILE DETECTION

Gracenote Digital Venture...

1. An in-automobile audio system comprising:a speaker configured to produce audio output;
a communication interface configured to access a wide-area network;
a processor; and
memory containing program instructions that, when executed by the processor, cause the in-automobile audio system to:
transmit, by way of the communication interface and to a profile server device, a request for a playlist, wherein the playlist contains references to audio files, the references previously stored on the profile server device as part of a save-to-automobile operation;
receive, by way of the communication interface and from the profile server device, the playlist;
based on the references in the playlist, transmit, by way of the communication interface and to an audio server device, a request for streaming of one or more of the audio files;
receive, by way of the communication interface and from the audio server device, a stream of the requested audio files; and
play out, by way of the speaker, the requested audio files.

US Pat. No. 10,140,372

USER PROFILE BASED ON CLUSTERING TIERED DESCRIPTORS

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by executing an instruction with a processor, descriptors in metadata that is descriptive of a first item and of a second item, the descriptors and metadata corresponding to a metadata model that organizes the descriptors into multiple tiers of the metadata model, the descriptors including a first descriptor of the first item and a second descriptor of the second item;
accessing, from a database communicatively coupled to the processor, the metadata model that organizes the descriptors into the multiple tiers;
creating, by executing an instruction with the processor, a group of descriptors by grouping the accessed first and second descriptors into the group of descriptors based on the accessed first and second descriptors being both represented in a same tier among the multiple tiers of the accessed metadata model that corresponds to the first and second descriptors;
accessing, via a device of a user communicatively coupled to the processor via a network, biometric data including a heart rate of the user;
determining, by executing an instruction with the processor, a first activity in which the user is engaged based on contextual data that correlates the first item and the second item with multiple locations of the user and the biometric data of the user received from the device of the user via the network;
generating, by executing an instruction with the processor, a user profile based on the first activity of the user and the created group of descriptors into which the first and second descriptors were grouped based on the first and second descriptors being both represented in the same tier among the multiple tiers of the accessed metadata model that corresponds to the first and second descriptors;
storing the group within the user profile as corresponding to the first activity determined based on the multiple locations and the biometric data of the user; and
recommending, by executing an instruction with the processor and in response to a second activity of the user matching the first activity associated with the group within the user profile, a third item based on the user profile, the user profile generated based on the created group of descriptors into which the first and second descriptors were grouped, the grouping performed based on the first and second descriptors being both represented in the same tier among the multiple tiers of the accessed metadata model.

US Pat. No. 9,813,751

MEDIA SOURCE IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method comprising:
detecting, by one or more processors, a control signal communicated by a first device to a second device, the control signal
causing the second device to select a data stream from a plurality of data streams;

accessing, by one or more of the processors, an output signal resultant from the second device in processing the data stream;
generating, by one or more of the processors, a representation of the output signal;
obtaining, by one or more of the processors, an identifier of a source of the data stream based on a comparison of the generated
representation of the output signal to a reference representation that corresponds to the data stream; and

assigning, by one or more of the processors, the identifier of the source of the data stream to the control signal that caused
the second device to select the data stream.

US Pat. No. 9,712,853

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A method comprising:
receiving, by a signal receiver of a media playback device, a first media stream;
playing back, on a display associated with the media playback device, the first media stream;
determining, by a processor of the media playback device, a trigger fingerprint from a segment of the first media stream,
wherein the trigger fingerprint is a representation of features of the segment;

matching the trigger fingerprint to a reference fingerprint, wherein the reference fingerprint is stored in a database of
reference fingerprints and is associated with a reference action, and wherein the reference action:

is associated with a time point indicating when, in the first media stream, the reference action is to be performed; and
instructs the media playback device to replace playback of the first media stream with playback of a second media stream;
and

responsive to matching the trigger fingerprint to the reference fingerprint, playing back, on the display associated with
the media playback device, the second media stream in place of the first media stream.

US Pat. No. 10,142,680

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A method comprising:causing, by a processor of a client device, a video output stream to be presented on a display of the client device; and
while causing the video output stream to be presented on the display:
accessing, by the processor, a video input stream comprising first video content from a content provider and second video content generated by a set-top box upstream of the processor, wherein, in a frame of the video input stream, the first video content corresponds to a first screen portion of the display and the second video content corresponds to a second screen portion of the display, wherein the video output stream comprises third video content from a replacement content source, wherein the third video content differs from the first video content and is accessed separately from the video input stream, wherein the third video content is selected in response to a video identification system receiving a query fingerprint from a query fingerprint generator of the client device and the video identification system identifying a match between the query fingerprint and at least one reference fingerprint generated by a reference fingerprint generator, wherein the query fingerprint is generated based on a frame of the first video content, and wherein the video identification system and the reference fingerprint generator are remote from the client device;
accessing, from a database, a template corresponding to the second screen portion of the display;
analyzing, by the processor, the video input stream to determine that at least a portion of the second video content of the frame of the video input stream matches the template;
based on the at least the portion of the second video content of the frame of the video input stream matching the template, determining, by the processor, that a speed of play of the video input stream has been modified from a default speed;
responsive to determining that the speed of play of the video input stream has been modified, modifying, by the processor, the video output stream by modifying a speed of play of the video output stream to generate a modified video output stream that mimics the speed of play of the video input stream; and
responsive to modifying the video output stream, causing, by the processor, the modified video output stream to be presented on the display.

US Pat. No. 10,063,918

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing by the computing system disambiguation based at least in part on a determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment; and
taking action by the computing system based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,290,298

REAL TIME POPULARITY BASED AUDIBLE CONTENT ACQUISITION

Gracenote Digital Venture...

1. A method comprising:analyzing, by a computing device, generated playlists for a plurality of client devices, wherein the generated playlists contain ordered references to text-to-speech-generated audio files of news stories respectively customized for the client devices;
determining, by the computing device, that a news story has a threshold extent of popularity based on a number of references to a text-to-speech-generated audio file of the news story that are included in the generated playlists;
responsive to determining that the news story has the threshold level of popularity, transmitting, by the computing device, to a remote news studio, a request for a human verbal reading of the news story;
receiving, by the computing device, from the remote news studio, an audio file of the human verbal reading;
updating, by the computing device, the references to the text-to-speech-generated audio file of the news story to refer to the audio file of the human verbal reading; and
transmitting, by the computing device, a playlist to a client device, wherein the playlist includes a plurality of ordered references to audio files of news stories, including the audio file of the human verbal reading, and wherein reception of the playlist by the client device triggers the client device to (i) traverse the playlist, (ii) based on the references, retrieve the audio files, including the audio file of the human verbal reading, and (iii) play out the audio files, including the audio file of the human verbal reading.

US Pat. No. 10,134,373

MACHINE-CONTROL OF A DEVICE BASED ON MACHINE-DETECTED TRANSITIONS

GRACENOTE, INC., Emeryvi...

1. A method comprising:accessing, by executing an instruction using one or more processors, a memory to retrieve audio data that represents audio received in an audio signal;
determining, by executing an instruction using the one or more processors, an audio characteristic in the audio, the audio characteristic being represented by an audio characteristic value;
detecting, by executing an instruction using the one or more processors, a transition between a music category and a talk category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to different categories in a set of categories including the music category and the talk category;
modifying, by executing an instruction using the one or more processors, at least one threshold value among the set of threshold values based on the detecting of the transition between the music category and the talk category; and
controlling, by executing an instruction using the one or more processors, a device based on the detected transition between the music category and the talk category.

US Pat. No. 10,055,010

DETECTING AN EVENT WITHIN INTERACTIVE MEDIA INCLUDING SPATIALIZED MULTI-CHANNEL AUDIO CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:generating, by one or more processors of a machine, a query fingerprint from a playback of spatialized media content that is spatialized in three dimensions and that indicates a virtual orientation at a virtual location in a virtual world;
accessing, by one or more processors of the machine, a database that stores reference fingerprints, a reference fingerprint among the reference fingerprints being generated from the spatialized media content, the reference fingerprint identifying information that corresponds to the virtual orientation at the virtual location in the virtual world;
querying, by one or more processors of the machine, the database with the query fingerprint to determine whether the query fingerprint matches any of the reference fingerprints;
selecting, by one or more processors of the machine, the information identified by the reference fingerprint based on the reference fingerprint matching the query fingerprint; and
causing, by one or more processors of the machine, presentation of the information that corresponds to the virtual orientation at the virtual location within the virtual world in response to the playback of the spatialized media content.

US Pat. No. 9,842,146

METHODS AND APPARATUS FOR DETERMINING A MOOD PROFILE ASSOCIATED WITH MEDIA DATA

Gracenote, Inc., Chicago...

1. A method comprising:
accessing, from a database, a first mood vector that describes first media data and specifies a first mood-value pair, the
first mood-value pair assigning a first value to a first mood;

accessing, from the database, a second mood vector that describes a second media data and specifies a second mood-value pair,
the second mood-value pair assigning a second value to a second mood;

retrieving a first score that quantifies congruency between the first and second moods, the first score being retrieved from
a data structure that correlates the first and second moods;

using a processor, generating a second score that quantifies congruency between the first and second mood vectors, the generating
the second score being based on the accessed first and second values and the retrieved first score; and

based on the generated second score, adding the first media data to a playlist.

US Pat. No. 9,736,549

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:
a processor configured to perform operations comprising:
receiving a main datastream and an item of mark-up information associated with a reference fingerprint;
computing main fingerprints from at least some main content elements included in the main datastream; and
linking a part of the main datastream to the item of mark-up information based on a comparison between the reference fingerprint
and one or more of the main fingerprints.

US Pat. No. 10,270,826

IN-AUTOMOBILE AUDIO SYSTEM PLAYOUT OF SAVED MEDIA

Gracenote Digital Venture...

1. An in-automobile audio system comprising:a speaker configured to produce audio output;
a first communication interface configured to access a local-area network;
a second communication interface configured to access a wide-area network;
a processor; and
memory containing program instructions that, when executed by the processor, cause the in-automobile audio system to:
transmit, by way of the first communication interface and to a client device, a request for a playlist, wherein the playlist contains references to audio files, the references previously stored on the client device using one or more save-to-automobile operations;
receive, by way of the first communication interface and from the client device, the playlist;
based on the references in the playlist, transmit, by way of the second communication interface and to an audio server device, a request for streaming of one or more of the audio files;
receive, by way of the second communication interface and from the audio server device, a stream of the requested audio files; and
play out, by way of the speaker, the requested audio files.

US Pat. No. 9,940,973

SYNTHESIZING A PRESENTATION OF A MULTIMEDIA EVENT

Gracenote, Inc., Chicago...

1. A system comprising:a processor-implemented media ingestion module configured to access a plurality of media clips including a first video clip with a first audio waveform and a second video clip with a second audio waveform;
a media analysis module configured to:
match a first acoustic fingerprint of at least a part of the first audio waveform with a second acoustic fingerprint of at least a part of the second audio waveform;
determine a temporal overlap of the first video clip with the second video clip based at least in part on the match; and
a content creation module configured to:
merge the first video clip and the second video clip into a group of overlapping video clips based on the temporal overlap relation; and
generate a presentation that includes the group formed by merging the first video clip and the second video clip based on the temporal overlap to generate the presentation.

US Pat. No. 10,536,746

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A method comprising:detecting by a computing system that digital fingerprint data representing media content being rendered by a media presentation device matches multiple instances of reference fingerprint data each corresponding with a different respective channel;
responsive to the detecting, performing by the computing system disambiguation based at least in part on a location of the media presentation device, to determine which channel the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple instances of reference fingerprint data, determining that the respective channel with which the particular instance of reference fingerprint data corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.

US Pat. No. 10,423,654

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:establishing, using a video fingerprinting system, a rectangular grid of blocks overlaying pixels of a frame of a reference video, wherein each block of the rectangular grid of blocks includes a grid of pixels, and wherein boundaries of the rectangular grid of blocks are defined such that some of the blocks of the rectangular grid of blocks include different numbers of pixels;
determining, for respective blocks of the rectangular grid of blocks, using the video fingerprinting system, respective intensity values corresponding to pixels included within the respective blocks of the rectangular grid of blocks;
determining, for respective blocks of the rectangular grid of blocks, using the video fingerprinting system, respective block values based on the respective intensity values;
generating, using the video fingerprinting system, fingerprint data for the frame based on the respective block values; and
storing, in a reference database, the fingerprint data in association with an identifier of the reference video.

US Pat. No. 10,387,482

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:establishing, using a computing device, a rectangular grid of sub-regions overlaying pixels of a frame of a query video;
determining, for respective sub-regions of the rectangular grid of sub-regions, using the computing device, respective pixel color data corresponding to pixels included within the respective sub-regions;
comparing, using the computing device, the respective pixel color data for the respective sub-regions with median color values for the rectangular grid of sub-regions;
determining, for the respective sub-regions, using the computing device, respective descriptor values based on the comparing;
generating, using the computing device, fingerprint data for the frame of the query video based on the respective descriptor values for the respective sub-regions;
providing, by the computing device, the fingerprint data to a server over a network, wherein the server has access to reference fingerprint data of a reference database, and wherein the reference fingerprint data corresponds to a plurality of reference videos and is derived using pixel color data for the plurality of reference videos; and
responsive to providing the fingerprint data to the server, receiving, by the computing device, an identifier of a reference video corresponding to the query video.

US Pat. No. 10,146,100

SYSTEMS AND METHODS TO TRANSFORM EVENTS AND/OR MOOD ASSOCIATED WITH PLAYING MEDIA INTO LIGHTING EFFECTS

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:a content identifier to:
identify content presented via a media presentation device based on a fingerprint associated with the content; and
derive metadata from the identified content; and
a content driven analyzer to:
determine a light effect to be produced by a light-generating device based on the metadata;
generate an instruction for the light-generating device to produce the light effect; and
transmit the instruction to the light-generating device during presentation of the content.

US Pat. No. 10,045,054

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A non-transitory machine-readable storage medium having stored thereon, program instructions that when executed by a processor of a machine, cause the machine to perform operations comprising:playing multimedia content on a multimedia playback device;
during the playback of the multimedia content by the multimedia playback device, repeatedly deriving, by the multimedia playback device, fingerprints from respective segments of the multimedia content;
comparing the derived fingerprints to one or more reference fingerprints representing features of the multimedia content, wherein each reference fingerprint is associated with an action;
determining that one of the derived fingerprints matches one of the one or more reference fingerprints; and
in response to determining that the one of the derived fingerprints matches the one of the one or more reference fingerprints, causing execution of the action associated with the one of the one or more reference fingerprints,
wherein the action to be executed is associated with a time point indicating when, in the multimedia content, the action is to be executed.

US Pat. No. 10,136,183

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

1. A method for dynamically providing a video overlay, the method comprising:accessing, by one or more processors, a video input stream that includes first video content from a first content provider;
based on the first video content of the video input stream, selecting replacement video content for a video output stream, wherein the replacement video content of the video output stream is different from the first video content of the video input stream, wherein the replacement video content is selected in response to a video identification system receiving a query fingerprint from a query fingerprint generator of a client device and the video identification system identifying a match between the query fingerprint and at least one reference fingerprint generated by a reference fingerprint generator, wherein the query fingerprint is generated based on a frame of the first video content, and wherein the video identification system and the reference fingerprint generator are remote from the client device;
causing the replacement video content of the video output stream to be presented on a display of the client device;
while the display is presenting the replacement video content of the video output stream, determining, by the one or more processors, that a first change in the video input stream has occurred, wherein the first change indicates a change from the first content provider to a second content provider of the video input stream has occurred, wherein the video input stream includes second video content from the second content provider after the first change has occurred;
in response to determining the change from the first content provider to the second content provider has occurred, causing the second video content of the video input stream to be presented on the display instead of the replacement video content;
after causing the second video content of the video input stream to be presented on the display:
accessing, from a database, a template corresponding to a screen portion of the display for third video content generated by an intermediate device;
analyzing, by the one or more processors, the video input stream to determine that at least a portion of the third video content of a frame of the video input stream matches the template when the video input stream includes the first video content from the first content provider and the third video content generated by the intermediate device, wherein the third video content is configured to be presented on the display to indicate an identity of the first content provider;
based on the at least the portion of the third video content of the frame of the video input stream matching the template, determining that a second change in the video input stream indicates a return to the first content provider of the video input stream; and
in response to the determination that the first content provider of the video input stream has been returned to:
modifying the video output stream to include the replacement video content and the third video content, and
causing a presentation of the replacement video content and the third video content of the modified video output stream on the display, wherein the third video content is presented as the video overlay on the replacement video content.

US Pat. No. 9,986,306

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by a receiver of a media device, a media stream that includes a sequence of items of content;
accessing, by the media device, a reference stream that includes reference fingerprints derived from respective segments of at least some of the items of content;
computing, by a processor of the media device, a fingerprint from a particular segment of the sequence of items of content of the media stream;
comparing, by the processor of the media device, the computed fingerprint to one or more of the reference fingerprints;
determining, by the processor of the media device, based on the comparing, that the computed fingerprint corresponds to a particular one of the one or more reference fingerprints; and
associating the particular segment of the media stream from which the fingerprint was computed with a time point associated with the particular one of the one or more reference fingerprints.

US Pat. No. 9,769,143

CONTENT-BASED ASSOCIATION OF DEVICE TO USER

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing, by one or more hardware processors, metadata of content items accessible by a device on which applications are
executable, the content items including distinct and overlapping first and second portions of the content items;

generating, by the one or more hardware processors executing a first module, a first identifier based on first metadata that
describes the first portion of the content items, the first module corresponding to a first application whose access to the
content items is limited to the first portion;

providing the first identifier to a server machine configured to assign the first identifier to a user identifier;
generating, by the one or more hardware processors executing a second module, a second identifier based on second metadata
that describes the second portion of the content items, the second module corresponding to a second application whose access
to the content items is limited to the second portion; and

providing the second identifier to the server machine configured to:
determine that the first and second portions are overlapping portions of the content items, based on the provided first and
second identifiers; and

assign the user identifier to the second identifier based on the first and second portions being overlapping portions of the
content items.

US Pat. No. 10,210,859

REAL TIME POPULARITY BASED AUDIBLE CONTENT ACQUISITION

Gracenote Digital Venture...

1. A method comprising:analyzing, by a computing device, generated playlists for a plurality of client devices, wherein the generated playlists contain ordered references to text-to-speech-generated audio files of news stories respectively customized for the client devices;
determining, by the computing device, that a news story has a threshold extent of popularity based on a number of references to a text-to-speech-generated audio file of the news story that are included in the generated playlists;
responsive to determining that the news story has the threshold level of popularity, transmitting, by the computing device, to a remote news studio, a request for a human verbal reading of the news story;
receiving, by the computing device, from the remote news studio, an audio file of the human verbal reading;
updating, by the computing device, the references to the text-to-speech-generated audio file of the news story to refer to the audio file of the human verbal reading; and
transmitting, by the computing device, a playlist to a client device, wherein the playlist includes a plurality of ordered references to audio files of news stories, including the audio file of the human verbal reading, and wherein reception of the playlist by the client device triggers the client device to (i) traverse the playlist, (ii) based on the references, retrieve the audio files, including the audio file of the human verbal reading, and (iii) play out the audio files, including the audio file of the human verbal reading.

US Pat. No. 10,147,407

CHARACTERIZING AUDIO USING TRANSCHROMAGRAMS

Gracenote, Inc., Emeryvi...

1. A method, comprising:generating, by executing one or more instructions on a processor, a set of transition matrices based on a plurality of time frames of audio data, each of the plurality of transition matrices generated based on a different pair of time frames in the plurality of time frames, and indicating probabilities that anterior musical notes will transition to posterior musical notes, the anterior musical notes in an anterior time frame of the pair and the posterior musical notes in a posterior time frame of the pair;
generating, by executing one or more instructions on a processor, a data structure representing how the audio data changes statistically between the plurality of time frames based on the set of transition matrices;
causing, by executing one or more instructions on a processor, a database to store the data structure within metadata that describes the audio data;
identifying, by executing one or more instructions on a processor, at least one of query audio data, a musical key of the query audio data, a musical chord of the query audio data, a song structure of the query audio data, and a musical genre of the query audio data; and
presenting, by executing one or more instructions on a processor, a notification, the notification indicating that the at least one of the query audio data, the musical key of the query audio data, the musical chord of the query audio data, the song structure of the query audio data, and the musical genre of the query audio data is identified.

US Pat. No. 9,794,642

INSERTING ADVERTISEMENTS INTO VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:
receiving a request to replace one or more frames of video content currently playing at a client device with supplemental
video content, the one or more frames of video content being at least one frame fewer than the supplemental video content;

determining one or more fingerprints of the video content currently playing at the client device;
by a processor, identifying the one or more frames of the video content to be replaced with the supplemental video content
based on the determined one or more fingerprints; and

replacing the identified one or more frames of the video content and at least one frame adjacent to the identified one or
more frames with the supplemental video content without shifting the at least one frame adjacent to the identified one or
more frames.

US Pat. No. 10,372,411

AUDIO STREAMING BASED ON IN-AUTOMOBILE DETECTION

Gracenote Digital Venture...

1. A computer-implemented method comprising:receiving, by a profile server device and from a client device, references to one or more audio files, wherein the references to the one or more audio files are provided as part of a save to automobile operation on the client device;
storing, by the profile server device, the references to the one or more audio files in a profile of a user, and wherein the profile is associated with the client device;
receiving, by the profile server device and from an automobile, a request for a playlist associated with the profile, wherein the user is associated with the automobile;
generating, by the profile server device, a playlist containing the references to the one or more audio files that were provided as part of the save to automobile operation; and
transmitting, by the profile server device and to the automobile, the playlist.

US Pat. No. 10,045,074

METHOD AND SYSTEM FOR DETECTING AND RESPONDING TO CHANGING OF MEDIA CHANNEL

Gracenote, Inc., Emeryvi...

1. A method of detecting and responding to a change of channels at a media presentation device, the method comprising:while the media presentation device is presenting media content determined to be of a first channel, detecting a threshold mismatch between (i) a query fingerprint stream representing the media content being presented by the media presentation device and (ii) a reference fingerprint stream representing the first channel, wherein determining that the query fingerprint stream represents the first channel is done using a first fingerprint comparison time interval, wherein detecting the threshold mismatch is done using a second fingerprint comparison time interval shorter than the first fingerprint comparison time interval, and wherein the threshold mismatch indicates a likelihood that the media presentation device has transitioned from presenting the first channel to presenting a second channel;
responsive to detecting the threshold mismatch, changing an overlay on the presented media content, wherein changing the overlay involves removing superimposed content associated with the first channel.

US Pat. No. 9,891,796

USER INTERFACE TO MEDIA FILES

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing a media file that stores acoustic data;
determining a mood of the media file based on an analysis of the acoustic data of the media file, the media file being characterized
as indicative of the determined mood, the determining of the mood causing incrementation of a count of media files characterized
as indicative of the determined mood within a predetermined period of time;

generating a user interface that includes zones among which is a zone whose position in the user interface corresponds to
the determined mood, the generating of the user interface including coloring the zone based on the incremented count of media
files characterized as indicative of the determined mood within the predetermined period of time, the generating of the user
interface being performed by a processor of a machine; and

presenting the user interface that includes the zone whose position corresponds to the mood of the media file, the zone being
colored based on the incremented count of media files characterized as indicative of the determined mood within the predetermined
time.

US Pat. No. 9,794,644

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Chicago...

1. A method comprising:
accessing, by a receiver of a media device, a media stream that includes a sequence of items of content;
accessing, by the media device, a reference stream that includes reference fingerprints derived from respective segments of
at least some of the items of content;

computing, by a processor of the media device, a fingerprint from a particular segment of the sequence of items of content
of the media stream;

comparing, by the processor of the media device, the computed fingerprint to one or more of the reference fingerprints;
determining, by the processor of the media device, based on the comparing, that the computed fingerprint corresponds to a
particular one of the one or more reference fingerprints;

identifying a time point associated with the particular one of the one or more reference fingerprints; and
associating the particular segment of the media stream from which the fingerprint was computed with the identified time point.

US Pat. No. 9,785,708

SCALABLE, ADAPTABLE, AND MANAGEABLE SYSTEM FOR MULTIMEDIA IDENTIFICATION

GRACENOTE, INC., Emeryvi...

1. A method of tracking multimedia content, the method comprising:
searching in a remote database in response to a query sent from a user device for multimedia content of a multimedia program
that is currently playing on the user device;

finding matching signatures for the multimedia content in a reference program stored in the remote database;
retrieving the matching signatures and reference signatures for succeeding segments of the reference program beginning from
the point in the reference program the multimedia content was found in the remote database;

storing the matching signatures and reference signatures in a local database on the user device;
tracking the multimedia program in the local database to find matching content for the succeeding segments of the reference
program;

determining the multimedia content has changed to different multimedia content than that stored in the succeeding segments
of the reference program; and

replacing the determined different multimedia content to a preselected multimedia content.

US Pat. No. 10,402,476

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:accessing first data defining multiple portions of a content item, wherein at least a plurality of the portions represent text;
selecting, from the plurality of portions representing text, a subset of the portions representing text, wherein the selecting is based on each portion of the selected subset having a particular characteristic;
based on the text represented by the portions of the selected subset, generating second data that represents a concatenation of the text represented by the portions of the selected subset; and
providing output based on the generated second data.

US Pat. No. 10,345,892

DETECTING AND RESPONDING TO AN EVENT WITHIN AN INTERACTIVE VIDEOGAME

Gracenote, Inc., Emeryvi...

1. A non-transitory machine-readable medium storing instructions executable by a processor to cause a machine to carry out operations including:detecting that an identifier established from presentation by a device of an interactive videogame matches a reference identifier corresponding with an event within the interactive videogame, wherein the established identifier is selected from the group consisting of a fingerprint of the interactive videogame and a watermark from the interactive videogame; and
responsive to at least detecting that the established identifier matches the reference identifier corresponding with the event within the interactive videogame, causing presentation of a notification that references the occurrence of the event.

US Pat. No. 9,992,518

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving at a media system, a first sequence of media content from a media content distributor over a first communication network, the first sequence of media content including steganographic data;
detecting, by the media system, the steganographic data in the first sequence of media content, wherein the steganographic data comprises identifying information relating to at least one of an advertiser, an advertisement campaign, or a unique identifier of a replaceable advertisement of the first sequence of media content;
responsive to detecting the steganographic data, transmitting, to a content replacement system over a second communication network, a request for a second sequence of media content;
receiving, by the media system, the second sequence of media content;
causing presentation of a portion of the first sequence of media content on a display of the media system;
causing presentation, on the display, of the second sequence of media content instead of the replaceable advertisement of the first sequence of media content; and
responsive to causing the presentation of the second sequence of media content and without user intervention, transmitting, by the media system, the identifying information for the replaceable advertisement to a viewer statistics reporting service to report that the replaceable advertisement was replaced by the media system.

US Pat. No. 9,905,267

COMPUTING SYSTEM WITH DVE TEMPLATE SELECTION AND VIDEO CONTENT ITEM GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:
receiving, by a computing system, a first group of video content items;
identifying, by the computing system, from among the first group of video content items, a second group of video content items
having a threshold extent of similarity with each other;

determining, by the computing system, a quality score for each video content item of the second group;
identifying, by the computing system, from among the second group of video content items, a third group of video content items
each having a quality score that exceeds a quality score threshold; and

based on the identifying of the third group, transmitting, by the computing system, at least a portion of at least one video
content item of the identified third group to a digital video-effect (DVE) system, wherein the DVE system is configured for
(i) receiving the transmitted at least the portion of the at least one video content item of the identified third group, and
(ii) using (a) the received at least the portion of the at least one video content item of the identified third group and
(b) a DVE template, to generate a video content item.

US Pat. No. 9,753,925

SYSTEMS, METHODS, AND APPARATUS FOR GENERATING AN AUDIO-VISUAL PRESENTATION USING CHARACTERISTICS OF AUDIO, VISUAL AND SYMBOLIC MEDIA OBJECTS

Gracenote, Inc., Emeryvi...

1. A method of creating an audio-visual presentation, the method comprising:
identifying a first plurality of media object characteristics associated with a first media object to be presented by a media
player device, the first media object including audio data, and the first plurality of media object characteristics including
a mood characteristic of the audio data;

determining a congruency between the first plurality of media object characteristics and a second plurality of media object
characteristics associated with a second media object that includes video data, the determining of the congruency including
accessing a score representative of a degree of congruency between one of the first plurality of media object characteristics
and one of the second plurality of media object characteristics, the score being accessed from a correlation matrix, the determining
of the congruency being based on the score and a threshold value;

determining a plurality of temporal segments of the audio data based on the mood characteristic;
determining a duration of a temporal segment of the plurality of temporal segments of the audio data;
identifying the second media object from a plurality of reference media objects based on the congruency between the first
plurality of media object characteristics and the second plurality of media object characteristics, the identifying of the
second media object being further based on the duration of the temporal segment of the audio data; and

using a processor, generating the audio-visual presentation to synchronously present the first media object and the second
media object on the media player device.

US Pat. No. 10,297,286

SYSTEM AND METHODS TO ASSOCIATE MULTIMEDIA TAGS WITH USER COMMENTS AND GENERATE USER MODIFIABLE SNIPPETS AROUND A TAG TIME FOR EFFICIENT STORAGE AND SHARING OF TAGGED ITEMS

Gracenote, Inc., Emeryvi...

1. An apparatus to add a tagged snippet of multimedia content to a playlist, the apparatus comprising:an automatic content recognition search service to search a fingerprint database to find a match between query fingerprints for a snippet of original multimedia content captured from a multimedia program at a timestamp and reference fingerprints of matching reference multimedia content stored in the fingerprint database;
a tag service to:
generate a tag representing the snippet of original multimedia content, wherein the tag, the timestamp, meta information associated with the matching reference multimedia content, and a monitored variable for a number of viewers of the snippet of original multimedia content are stored in a database storage as a tagged snippet of multimedia content; and
add the tagged snippet of multimedia content to a playlist for an identified multimedia program if the number of viewers of the tagged snippet exceeds a threshold.

US Pat. No. 10,635,701

MODEL-BASED MEDIA CLASSIFICATION SERVICE USING SENSED MEDIA NOISE CHARACTERISTICS

GRACENOTE, INC., Emeryvi...

1. A method for classifying media, the method comprising:accessing, using one or more processor circuits associated with a first device, first digital media data that represents a first media query to be identified;
determining, using the one or more processor circuits associated with the first device, a first noise characteristic corresponding to the first digital media data;
determining whether a prior classification attempt for a second media query was successful, the second media query corresponding to second digital media data having a similar noise characteristic to the first noise characteristic;
in response to determining the prior classification attempt was successful, transmitting the first digital media data to a remote media classification circuit that is configured to identify a source characteristic of the first digital media data; and
in response to determining the prior classification attempt unsuccessful, inhibiting the transmission of the first digital media data to the remote media classification circuit.

US Pat. No. 10,599,705

RETRIEVING AND PLAYING OUT MEDIA CONTENT FOR A PERSONALIZED PLAYLIST INCLUDING A CONTENT PLACEHOLDER

Gracenote Digital Venture...

1. A method comprising:receiving, by a client device, a playlist defining a sequence of media content items including a first media content item and a second media content item, and further defining a placeholder in between the first and second media content items;
retrieving, by the client device, from one or more server devices, first data representing the media content items of the received playlist, and using the retrieved first data to play out the media content items; and
while playing out at least one of the media content items represented by the retrieved first data:
making, by the client device, a first determination of whether the playlist defines a placeholder between the first and second media content items;
responsive to the first determination being that the playlist defines a placeholder between the first and second content items, making, by the client device, a second determination of whether a condition has been satisfied, wherein the condition comprises a condition that a content category type has a particular relationship with at least one of the media content items that has been played out, and wherein the condition further comprises a condition that a particular time is within a particular predetermined time period range and that the content category type is associated with the particular predetermined time period range; and
responsive to the second determination being that the condition has been satisfied, (i) retrieving, by the client device, from the one or more server devices, second data representing a media content item that has the content category type and that is not specified in the accessed playlist, and (ii) using, by the client device, the retrieved second data to play out the media content item of the content category type in between playing out the first and second media content items.

US Pat. No. 10,565,980

AUDIO STREAMING OF TEXT-BASED ARTICLES FROM NEWSFEEDS

Gracenote Digital Venture...

1. A method comprising:receiving, by a server device from a client device, a representation of a plurality of uniform resource locators (URLs) of text-based articles, wherein the text-based articles were selected by the client device from two or more markup language files provided by two or more newsfeed servers;
in response to receiving the representation, retrieving, by the server device, the text-based articles and respective publication dates of each from the two or more newsfeed servers;
causing, by the server device, the text-based articles to be converted into audio files;
receiving, by the server device, a request to stream the audio files to the client device;
ordering, by the server device, the text-based articles chronologically according to the respective publication dates, wherein ordering the text based articles chronologically according to the respective publication dates comprises generating a playlist with references to the URLs ordered chronologically according to the respective publication dates; and
causing, by the server device, the audio files to be streamed to the client device as chronologically ordered, wherein causing the audio files to be streamed to the client device as chronologically ordered comprises transmitting the playlist to the client device.

US Pat. No. 10,546,016

AUDIOVISUAL CONTENT CURATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method, comprising:receiving, at a server computer, a request for a station library, the request including criteria for the station library, the criteria including at least a first genre;
filtering, by the server computer, at least one media content catalog based on the criteria for the station library to generate a first list of candidate tracks for the station library;
determining, by the server computer, a first total number of tracks to be selected for the first genre based on the criteria;
generating, by the server computer, first similarity scores for respective ones of the tracks of the first list of candidate tracks in a first step, the first step including generating the first similarity scores determining correlation values between a primary genre for respective ones of the tracks in the first list of candidate tracks and the first genre, wherein the correlation values are compared to a specified threshold;
determining, by the server computer, first popularity scores for respective ones of the tracks of the first list of candidate tracks;
combining, by the server computer, the first similarity score and the first popularity score associated with each track of the first list of candidate tracks to generate first total scores for respective ones of the tracks of the first list of candidate tracks;
selecting, by the server computer, first top ranked tracks of the first list of candidate tracks based on the first total scores associated with respective ones of the tracks of the first list of candidate tracks and the first total number of tracks to be selected for the first genre to generate a list of top ranked tracks for the first genre; and
returning, by the server computer, the first list of top ranked tracks of the first genre as part of the station library.

US Pat. No. 10,341,734

METHOD AND SYSTEM FOR PRESENTING ADDITIONAL CONTENT AT A MEDIA SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a media system, a first sequence of media content;
receiving, by the media system, a subset of reference fingerprints selected from a plurality of reference fingerprints based on the reference fingerprints of the subset of reference fingerprints being associated with a subset of channels of a plurality of channels that the media system is used to watch more frequently than other channels of the plurality of channels;
generating, by the media system, a comparison fingerprint using first media content within the first sequence of media content;
determining, by the media system, that the comparison fingerprint does not match any reference fingerprints of the subset of reference fingerprints;
sending, by the media system, to a server system, a request for additional media content that includes the comparison fingerprint, wherein, based on the determining, the media system includes the comparison fingerprint in the request for comparison with additional reference fingerprints at the server system;
receiving, by the media system, a response to the request that includes information enabling the media system to replace or supplement the first media content with second media content, wherein the information comprises data indicative of an insertion point for the second media content;
determining, by the media system, a frame within the first sequence of media content at which to present the second media content based on the data indicative of the insertion point; and
providing, by the media system, for display a second sequence of media content that includes a portion of the first sequence of media content that occurs prior to the frame and includes the second media content.

US Pat. No. 9,998,767

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, at a media system, a sequence of media content;
sending, by a processor of the media system, a request for additional content to a remote server system via a network;
receiving a response to the request via the network, the response including the additional content; and
causing presentation of a displayed sequence of media content that includes at least a portion of the accessed sequence of media content and at least a portion of the additional content, the presenting of the displayed sequence including:
interrupting the at least the portion of the additional content by causing presentation of a further sequence of media content in response to a first request received during the presentation of the at least the portion of the additional content, the further sequence of media content being presented without presenting the additional content;
if a second request is received within a predetermined time of the first request, resuming presentation of the additional content in response to the second request; and
if the second request is received outside the predetermined time of the first request, presenting the accessed sequence of media content responsive to the second request,
wherein the predetermined time is related to a duration of the additional content.

US Pat. No. 9,966,112

SYSTEMS AND METHODS TO ASSOCIATE MULTIMEDIA TAGS WITH USER COMMENTS AND GENERATE USER MODIFIABLE SNIPPETS AROUND A TAG TIME FOR EFFICIENT STORAGE AND SHARING OF TAGGED ITEMS

Gracenote, Inc., Emeryvi...

1. A method to add a tagged snippet of multimedia content to a playlist based on a number of viewers, the method comprising:searching a fingerprint database to find a match between query fingerprints for a snippet of multimedia content captured from a multimedia program at a timestamp and original fingerprints of matching original multimedia content stored in the fingerprint database that identifies the snippet of multimedia content and the multimedia program;
generating a tag representing the snippet of multimedia content, wherein the tag, the timestamp, meta information associated with the matching original multimedia content, and a monitored variable for a number of viewers of the snippet of multimedia content are stored in a database storage as a tagged snippet of multimedia content; and
adding the tagged snippet of multimedia content to a playlist for the identified multimedia program if the number of viewers of the tagged snippet exceeds a threshold.

US Pat. No. 9,959,345

SEARCH AND IDENTIFICATION OF VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:receiving a search query that includes a group of query fingerprints, each fingerprint of the group of query fingerprints being associated with a frame of video content presented at a client device, the frames of video content associated with the group of query fingerprints being sequential;
accessing, by a processor of a machine, an index of reference fingerprints, each reference fingerprint of the reference fingerprints corresponding to a sequential frame of known video content, each reference fingerprint of the reference fingerprints having an associated first value calculated from one or more first regions of the corresponding frame of known video content and an associated second value calculated from one or more second regions of the corresponding frame of known video content;
identifying two or more match candidates based on a comparison of the group of query fingerprints to the first values associated with the reference fingerprints;
selecting one or more of the identified match candidates based on a comparison of the group of query fingerprints to the second values associated with the identified match candidates; and
identifying the video content presented at the client device based on the selected match candidates.

US Pat. No. 10,715,840

ESTABLISHMENT AND USE OF TIME MAPPING BASED ON INTERPOLATION USING LOW-RATE FINGERPRINTING, TO HELP FACILITATE FRAME-ACCURATE CONTENT REVISION

Gracenote, Inc., Emeryvi...

1. A method comprising:ascertaining a plurality of matching points between (i) query fingerprints representing a media stream being received by a client and (ii) reference fingerprints, wherein each identified matching point defines a respective match between (i) a query fingerprint that is timestamped with client time defined according to a clock of the client and (ii) a reference fingerprint that is timestamped with true time according to a timeline within a known media stream;
performing linear regression based on the timestamps of the ascertained plurality of matching points, the linear regression establishing a mapping between true time and client time;
using the established mapping as a basis to determine a client-time point at which the client should perform an action with respect to media stream being received by the client; and
performing, at the determined client-time point, the action with respect to the media stream being received by the client,
wherein each matching point is based on a comparison between (i) a respective bundle of the query fingerprints and (ii) the reference fingerprints, wherein a given bundle of the query fingerprints is consecutive query fingerprints, and wherein the method further comprises detecting that the consecutive query fingerprints of the given bundle of the query fingerprints match each other and, responsive to the detecting that the consecutive query fingerprints of the given bundle match each other, excluding the given bundle from use to establish a matching point for the linear regression.

US Pat. No. 10,339,219

COMPUTING DEVICE AND CORRESPONDING METHOD FOR GENERATING DATA REPRESENTING TEXT

Gracenote Digital Venture...

1. A method comprising:accessing first data representing text, wherein the text defines at least two positions representing a particular type of grammatical break between two portions of the text, and further wherein the grammatical break is at least one of a paragraph break or a sentence break;
identifying, from among the at least two positions, a position that is closest to a target position within the text;
based on the identified position within the text, generating second data that represents a proper subset of the text, wherein the proper subset extends from an initial position within the text to the identified position within the text; and
providing output based on the generated second data.

US Pat. No. 10,210,252

METHOD AND APPARATUS FOR MULTI-DIMENSIONAL CONTENT SEARCH AND VIDEO IDENTIFICATION

Gracenote, Inc., Emeryvi...

1. A method of generating a likelihood score of a query video sequence matching an original video sequence, the method comprising:generating a similarity measure between frames of a query video sequence and an original video sequence based on frame similarity scores that exceed a threshold, wherein the original video is an entry in a video database;
generating a time correlation between features extracted from frames having different frame numbers in the query video sequence and in the original video sequence;
generating a correlation score between the original video sequence and the query video sequence by using a combination of the similarity measure and the time correlation to identify the likelihood score of the query video sequence matching the original video sequence; and
transmitting the query video sequence matching the original video sequence to a database of traversal indexes based on correlation scores that exceed a threshold.

US Pat. No. 9,924,222

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A method comprising:
determining, by a computing system, that a digital fingerprint of media content being rendered by a media presentation device
matches multiple reference fingerprints each corresponding with a different respective channel;

responsive to determining that the digital fingerprint matches the multiple reference fingerprints each corresponding with
a different respective channel, performing by the computing system disambiguation based at least in part on location of the
media presentation device, to determine which channel the media presentation device is receiving the media content on, wherein
performing disambiguation based at least in part on the location of the media presentation device, to determine which channel
the media presentation device is receiving the media content on, comprises (i) determining the location of the media presentation
device, (ii) comparing the determined location of the media presentation device with locations indicated for the multiple
reference fingerprints to find that the location of the media presentation device matches a location indicated for a particular
reference fingerprint of the multiple reference fingerprints, and (iii) establishing based at least on the finding that the
location of the media presentation device matches the location indicated for the particular reference fingerprint of the multiple
reference fingerprints, that a channel associated with the particular reference fingerprint is the channel that the media
presentation device is receiving the media content on; and

taking action by the computing system based on the determined channel.

US Pat. No. 9,706,233

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:
accessing, by one or more processors of a media system operatively connected to a display, a sequence of media content, wherein
the sequence of media content is presented on the display of the media system;

accessing a fingerprint of a first portion of the sequence of media content;
comparing the fingerprint with a plurality of reference fingerprints stored in a database;
based on the comparison, identifying the sequence of media content and determining a position of the first portion within
the sequence of media content;

accessing replacement media content at the media system, wherein the replacement media content is selected based at least
in part on the identified sequence of media content;

determining, by the one or more processors, an offset position in the sequence of media content relative to the position of
the first portion, the offset position corresponding to a second portion of the sequence of media content; and

causing the replacement media content to be presented on the display of the media system beginning at the determined offset
position.

US Pat. No. 10,714,145

SYSTEMS AND METHODS TO ASSOCIATE MULTIMEDIA TAGS WITH USER COMMENTS AND GENERATE USER MODIFIABLE SNIPPETS AROUND A TAG TIME FOR EFFICIENT STORAGE AND SHARING OF TAGGED ITEMS

Gracenote, Inc., Emeryvi...

1. A non-transitory machine-readable storage medium having stored thereon, program instructions that when executed by a processor of a machine, cause the machine to perform operations comprising:capturing a snippet of multimedia content from a multimedia program at a timestamp;
generating a tag representing the snippet of multimedia content, wherein the tag, the timestamp, meta information associated with matching original multimedia content, and a monitored variable for a number of viewers of the snippet of multimedia content are stored in a snippet database as a tagged snippet of multimedia content;
performing a comparison the tagged snippet of multimedia content at a selected insertion point in a playlist of tags with other snippets in the playlist of tags if the number of viewers of the tagged snippet exceeds a threshold to automatically determine whether there is an overlap between the tagged snippet and the other snippets;
determining, based on the comparison, that there is the overlap between the tagged snippet and the other snippets; and
responsive to determining that there is the overlap, automatically removing duplicate content from the tagged snippet determined to have the overlap between the tagged snippet and the other snippets to create a modified snippet with an adjusted size without the overlap for insertion in the playlist of tags.

US Pat. No. 10,555,052

METHOD AND APPARATUS FOR SELECTION OF CONTENT FROM A STREAM OF DATA

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by a receiver of a media device, a media stream that includes a sequence of items of content;
accessing, by the media device, a reference stream defining a sequence of reference fingerprints corresponding to respective segments of at least some of the items of content;
computing, by a processor of the media device, a computed fingerprint from a particular segment of the sequence of items of content of the media stream;
comparing, by the processor of the media device, the computed fingerprint to one or more of the reference fingerprints; and
selecting, by the processor of the media device and based on the comparing, one or more of the items of content from the media stream.

US Pat. No. 10,448,096

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on an earlier determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment, and
taking action based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,393,530

DYNAMIC CONTENT DELIVERY BASED ON VEHICLE NAVIGATIONAL ATTRIBUTES

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing device in a vehicle, a relevancy score for respective ones of a plurality of sporting event data items based on navigational attributes of the vehicle, the plurality of sporting event data items obtained from a server computer by the computing device, the navigational attributes including at least one of a location of the vehicle, vehicle sensor data, or vehicle navigational data;
determining, by the computing device in the vehicle, whether the navigational attributes correspond to a driving condition; and
when the navigational attributes correspond to a high-maintenance driving condition, presenting, by the computing device in the vehicle, a first sporting event data item of the plurality of sporting event data items when a first relevancy score of the first sporting event data item corresponds to the high-maintenance driving condition.

US Pat. No. 10,349,130

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on an earlier determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment, and
taking action based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,282,471

AUDIO MATCHING BASED ON HARMONOGRAM

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, using one or more processors, audio data that represents query sound to be identified;
creating, using the one or more processors, a spectrogram of the audio data, different segments of the spectrogram representing amplitudes at frequencies in different time slices of the query sound;
determining, using the one or more processors, a dominant frequency in a time slice of the query sound based on a segment of the spectrogram, the determining including:
calculating an aggregate energy value of a candidate frequency based on amplitudes of the candidate frequency and harmonics thereof represented in the segment of the spectrogram; and
identifying the candidate frequency as the dominant frequency based on the aggregate energy value of the candidate frequency being a largest aggregate energy value among aggregate energy values of frequencies whose amplitudes are represented in the segment of the spectrogram;
creating, using the one or more processors, a query harmonogram of the audio data, different segments of the query harmonogram representing aggregate energy values of dominant frequencies in different time slices of the query sound; and
providing, using the one or more processors, an identifier of the query sound based on a comparison of the query harmonogram to a reference harmonogram mapped to the identifier by a database.

US Pat. No. 10,123,073

DYNAMIC VIDEO OVERLAYS

Gracenote, Inc., Emeryvi...

8. A system comprising: a memory that stores instructions; and one or more processors, of a client device, configured by the instructions to perform operations comprising:causing a video output stream to be presented on a display of the client device;
while causing the video output stream to be presented on the display:
accessing a video input stream comprising first video content from a content provider and second video content generated by a set-top box upstream of the one or more processors, wherein, in a frame of the video input stream, the first video content corresponds to a first screen portion of the display and the second video content corresponds to a second screen portion of the display, wherein the video output stream comprises third video content from a replacement content source, wherein the third video content differs from the first video content and is accessed separately from the video input stream, wherein the third video content is selected in response to a video identification system receiving a query fingerprint from a query fingerprint generator of the client device and the video identification system identifying a match between the query fingerprint and at least one reference fingerprint generated by a reference fingerprint generator, wherein the query fingerprint is generated based on a frame of the first video content, and wherein the video identification system and the reference fingerprint generator are remote from the client device;
accessing, from a database that stores a plurality of sets of templates, a template corresponding to the second screen portion of the display, wherein the database maps a plurality of set-top box identifiers to the plurality of sets of templates, wherein the template corresponding to the second screen portion of the display is accessed from a set of templates based on an identifier of the set-top box;
analyzing the video input stream to determine that at least a portion of the second video content of the frame of the video input stream matches the template;
responsive to determining that the at least the portion of the second video content of the frame of the video input stream matches the template, modifying the video output stream to include, in the second screen portion of the display, the second video content generated by the set-top box; and causing the modified video output stream, including the second video content in the second screen portion and the third video content in the first screen portion, to be presented on the display.

US Pat. No. 10,110,970

INSERTING ADVERTISEMENTS INTO VIDEO CONTENT

Gracenote, Inc., Emeryvi...

1. A method, comprising:determining one or more fingerprints of video content currently playing at a client device;
by a processor, identifying one or more frames of the video content to be replaced with supplemental video content based on the determined one or more fingerprints, wherein the one or more frames of video content are at least one frame fewer than the supplemental video content; and
replacing the identified one or more frames of the video content and at least one frame adjacent to the identified one or more frames with the supplemental video content without shifting the at least one frame adjacent to the identified one or more frames,
wherein the identifying of the one or more frames of the video content to be replaced with the supplemental video content based on the determined one or more fingerprints includes identifying a frame of the video content that is associated with a transition frame inserted into the video content.

US Pat. No. 9,992,533

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON MATCHING WITH DIFFERENTIAL REFERENCE—FINGERPRINT FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a query fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel than each other;
responsive to at least determining that the query fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation by (i) identifying a fingerprint feature that differs as between the multiple reference fingerprints and (ii) determining that the query fingerprint matches just one of the reference fingerprints as to the identified fingerprint feature, whereby the disambiguation establishes that the media presentation device is receiving the media content on the channel corresponding with the one reference fingerprint; and
taking action based on the establishing that the media presentation device is receiving the media content on the channel corresponding with the one reference fingerprint,
wherein the computing system includes or has access to channel-data that indicates for the channels corresponding with the multiple reference fingerprints a region of media content that is channel specific, and wherein identifying the fingerprint feature that differs as between the multiple reference fingerprints comprises (i) referring to the data to determine the region of media content that is channel specific and (ii) identifying as the fingerprint feature that differs as between the multiple reference fingerprints a fingerprint feature corresponding with the determined region of media content, and
wherein the media content comprises video content, and wherein the region of media content that is channel specific comprises a video frame region that is at least one of (i) at a video frame edge or (ii) where channel identification is presented.

US Pat. No. 10,503,956

IDENTIFYING VIDEO CONTENT VIA COLOR-BASED FINGERPRINT MATCHING

Gracenote, Inc., Emeryvi...

1. A method comprising:processing unknown video content to obtain a first fingerprint and a second fingerprint, the first fingerprint being a color-based fingerprint that includes a color-based value determined for each color of a plurality of colors in a portion of the unknown video content, the second fingerprint being a patch-based fingerprint that includes a patch-based value determined for each patch of a plurality of patches in the portion of the unknown video content, wherein each patch-based value is based on at least one of luminance, optical flow, or a Haar-like feature;
initiating a query of at least one reference database of reference fingerprints using one of the first fingerprint to obtain a candidate group of fingerprints or the second fingerprint to obtain the candidate group of fingerprints, the candidate group of fingerprints being a subset of the reference fingerprints queried; and
responsive to the query, initiating a query of the candidate group of fingerprints using the other of the first fingerprint and the second fingerprint to identify at least one query fingerprint, wherein the unknown video content is identified using the at least one query fingerprint.

US Pat. No. 10,313,748

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to cause a computing system to carry out operations comprising:receiving from a media presentation device a communication providing a digital fingerprint of media content being rendered by the media presentation device;
receiving from the media presentation device an indication of location of the media presentation device;
determining that the digital fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the digital fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on the received indication of location of the media presentation device, to determine which channel that the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple reference fingerprints, determining that the respective channel with which the particular reference fingerprint corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.

US Pat. No. 10,149,007

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a digital video fingerprint of media content being rendered by a media presentation device matches multiple reference video fingerprints each corresponding with a different respective channel;
responsive to at least determining that the digital video fingerprint matches the multiple reference video fingerprints each corresponding with a different respective channel, performing disambiguation by the computing system based at least in part on a determination that a digital audio fingerprint of the media content being rendered by the media presentation device matches a reference audio fingerprint corresponding with just a single channel of the channels with which the multiple reference video fingerprints correspond, the disambiguation establishing that the media presentation device is receiving the media content on the single channel; and
taking action by the computing system based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,713,296

AUDIO IDENTIFICATION BASED ON DATA STRUCTURE

GRACENOTE, INC., Emeryvi...

1. A computerized method comprising:accessing, using one or more hardware processors, reference audio to be represented by a reference data structure to be generated and stored in a reference database;
generating, using the one or more hardware processors, the reference data structure from the reference audio by at least:
performing a constant Q transform on multiple reference time slices of the reference audio;
binarizing the constant Q transformed reference time slices of the reference audio;
performing a two-dimensional Fourier transform on multiple reference time windows within the binarized and constant Q transformed reference time slices of the reference audio to obtain two-dimensional Fourier transforms of the reference time windows; and
sequentially ordering the two-dimensional Fourier transforms of the reference time windows in the reference data structure;
creating, within the reference database, a data association between the reference audio and the generated reference data structure that includes the sequentially ordered two-dimensional Fourier transforms of the reference time windows, the created data association indicating that the reference data structure is an identifier of the reference audio;
accessing query audio to be represented by a query data structure for comparison to the reference data structure that represents the reference audio; and
generating the query data structure from the query audio by at least:
performing a constant Q transform on multiple query time slices of the query audio;
binarizing the constant Q transformed query time slices of the query audio by, for each constant Q transformed query time slice, calculating a median value of a range of constant Q transformed query time slices that encompasses the constant Q transformed query time slice and binarizing the constant Q transformed query time slice based on the calculated median value of the range;
performing a two-dimensional Fourier transform on multiple query time windows within the binarized and constant Q transformed query time slices of the query audio to obtain two-dimensional Fourier transforms of the query time windows; and
sequentially ordering the two-dimensional Fourier transforms of the query time windows in the query data structure.

US Pat. No. 10,631,049

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive media content to be presented by the media presentation device;
a media presentation interface for presenting the media content;
a network communication interface;
a processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the processing unit to carry out operations including:
generating digital video fingerprint data representing the media content, based on analysis of the media content, and outputting the generated digital video fingerprint data for transmission through the network communication interface to a server,
after outputting the digital video fingerprint data for transmission to the server, receiving from the server a request for audio fingerprint data representing the media content, the request being responsive to the server determining that the digital video fingerprint data matches reference video fingerprint data corresponding with multiple channels,
responsive to the request, outputting digital audio fingerprint data representing the media content, for transmission of the digital audio fingerprint data through the network communication interface to the server, and
causing the media presentation device to present, in conjunction with the media content, supplemental channel-specific content associated with one of the multiple channels, the one channel being identified from among the multiple channels based on a determination that the digital audio fingerprint data matches reference audio fingerprint data representing just the one channel of the multiple channels.

US Pat. No. 10,599,706

RETRIEVING AND PLAYING OUT MEDIA CONTENT FOR A PERSONALIZED PLAYLIST

Gracenote Digital Venture...

1. A method comprising:receiving, by a client device, a playlist defining a sequence of media content items including a first media content item and a second media content item, wherein the first and second media content items are immediately adjacent to each other in the defined sequence, wherein in the sequence, the first media content item is immediately followed by the second media content item;
retrieving, by the client device, from one or more server devices, first data representing the media content items of the received playlist, and using the retrieved first data to play out the media content items; and
while playing out at least one of the media content items represented by the retrieved first data:
making, by the client device, a determination of whether a condition has been satisfied, wherein the condition comprises a condition that a content category type has a particular relationship with at least one of the media content items that has been played out, and wherein the condition further comprises a condition that a particular time is within a particular predetermined time period range and that the content category type is associated with the particular predetermined time period range; and
responsive to the determination being that the condition has been satisfied, (i) retrieving, by the client device, from the one or more server devices, second data representing a media content item that has the content category type and that is not specified in the accessed playlist, and (ii) using, by the client device, the retrieved second data to play out the media content item of the content category type in between playing out the first and second media content items.

US Pat. No. 10,595,053

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A system comprising:a memory that stores instructions; and
one or more processors configured by the instructions to perform operations comprising:
deriving fingerprints from respective segments of multimedia content while the multimedia content is being played by a multimedia playback device;
comparing the derived fingerprints to one or more reference fingerprints representing features of the multimedia content, wherein at least one reference fingerprint of the one or more reference fingerprints is associated with an action;
determining that one of the derived fingerprints matches the at least one reference fingerprint; and
in response to determining that the one of the derived fingerprints matches the at least one reference fingerprint, causing execution of the action associated with the at least one reference fingerprint,
wherein the action to be executed is associated with a time point indicating when, in the multimedia content, the action is to be executed.

US Pat. No. 10,503,998

RECURRENT DEEP NEURAL NETWORK SYSTEM FOR DETECTING OVERLAYS IN IMAGES

Gracenote, Inc., Emeryvi...

1. A method comprising:a processor applying a feature map network to an image to create a feature map, wherein the feature map comprises a grid of vectors characterizing at least one feature in the image;
the processor applying a probability map network to the feature map to create a probability map assigning a probability to the at least one feature in the image, wherein the assigned probability corresponds to a likelihood that the at least one feature is an overlay;
the processor determining that the assigned probability exceeds a threshold probability value; and
responsive to the processor determining that the assigned probability exceeds the threshold probability value, the processor (i) removing the at least one feature from the image or (ii) inpainting the at least one feature.

US Pat. No. 10,419,508

SAVING MEDIA FOR IN-AUTOMOBILE PLAYOUT

Gracenote Digital Venture...

1. A method comprising:receiving, by a server device, an indication of a text-based article selected by way of a first client device, wherein the indication specifies that the text-based article has been flagged for audible playout at a later time;
based on the indication specifying that the text-based article has been flagged for audible playout at the later time, causing, by the server device, the text-based article to be converted into an audio file;
receiving, by the server device, a request to stream the audio file to a second client device, wherein the second client device is associated with the first client device; and
causing, by the server device, the audio file to be streamed to the second client device.

US Pat. No. 10,347,295

COMPUTING SYSTEM WITH VIDEO CONTENT GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, first video content captured by a front-facing camera of an end-user device;
receiving, by the computing system, second video content captured by a rear-facing camera of the end-user device, wherein the first video content is captured by the front-facing camera while the second video content is captured by the rear-facing camera;
accessing, by the computing system, a program schedule of a video program;
based on the accessed program schedule, selecting, by the computing system, a digital video effect (DVE) template for generating third video content that includes the received first video content and the received second video content; and
using, by the computing system, the received first video content, the received second video content, and the selected DVE template to generate the third video content that includes the received first video content and the received second video content.

US Pat. No. 10,108,619

STATION LIBRARY CREATON FOR A MEDIA SERVICE

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing seed metadata, the seed metadata describing a seed, the seed being a basis on which a station library is to be defined;
generating a station descriptor profile based on the seed metadata, the station descriptor profile defining a genre composition of the station library;
generating a candidate set based on the seed metadata, the candidate set comprising a plurality of candidate media files;
for each candidate media file in the candidate set:
computing a similarity score associated with the candidate media file, the similarity score including a measure of similarity between the candidate media file and the station descriptor profile;
computing one or more boost values associated with the candidate media file based on candidate metadata describing the candidate media file; and
computing a relevancy score associated with the candidate media file based on the similarity score and the one or more boost values associated with the candidate media file, the relevancy score providing a basis for selecting the candidate media file for inclusion in the station library; and
machine-generating a station set including a portion of the candidate set selected based on the relevancy scores associated with the plurality of candidate media files included in the candidate set, the machine-generated station set defining the station library by referencing each candidate media file in the portion of the candidate set,
wherein the station descriptor profile includes one or more focus genre profiles and each candidate media file includes a file genre profile, the one or more focus genre profiles and the file genre profile each including respective multiple genre values and a weight assigned to each genre value, each genre value corresponding to a genre, each weight indicating a percentage of the corresponding genre value relative to the other genre values in the respective multiple genre values, and
computing the similarity score associated with the candidate media file includes:
computing one or more focus-level similarity scores by comparing, for each focus genre profile in the station descriptor profile: (a) the respective multiple genre values and the corresponding weights of the focus genre profile, and (b) the respective multiple genre values and the corresponding weights of the file genre profile of the candidate media file; and
selecting the highest focus-level similarity score to be the similarity score associated with the candidate media file.

US Pat. No. 10,045,073

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON TIME OF BROADCAST

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on a time of broadcast of the media content being rendered by the media presentation device, to determine which channel the media presentation device is receiving the media content on; and
taking action based on the determined channel,
wherein performing disambiguation based at least in part on the time of broadcast of the media content being rendered by the media presentation device comprises determining the time of broadcast of the media content being rendered by the media presentation device and using the determined time of broadcast of the media content being rendered by the media presentation device as a basis to determine which one of the matching reference fingerprints corresponds with the channel on which the media presentation device is receiving the media content,
wherein each of the matching reference fingerprints has a corresponding time of broadcast, and wherein using the determined time of broadcast of the media content being rendered by the media presentation device as a basis to determine which one of the matching reference fingerprints corresponds with the channel on which the media presentation device is receiving the media content comprises (i) comparing the determined time of broadcast of the media content being rendered by the media presentation device with each matching reference fingerprint's corresponding time of broadcast, and (ii) determining based on the comparing that the time of broadcast of the media content being rendered by the media presentation device matches just the time of broadcast corresponding with the one matching reference fingerprint.

US Pat. No. 9,959,343

GENERATING AND DISTRIBUTING A REPLACEMENT PLAYLIST

Gracenote, Inc., Emeryvi...

1. A method comprising:generating, by a server device, a first playlist for a client device, wherein the first playlist includes references to a first plurality of audio files in a first specified order, wherein playout of the first plurality of audio files has a first duration, wherein the client device and the server device are in communication with one another by way of a wide-area network, and wherein the client device includes an audio player application;
transmitting, by the server device over the wide area network, the first playlist to the client device, wherein reception of the first playlist at the client device causes the audio player application to retrieve and play out at least some of the first plurality of audio files in the first specified order;
while the client device is playing out a current audio file of the first plurality of audio files, receiving, by the server device, an instruction from the client device, wherein the instruction specifies that the client device has activated a sleep timer with a target duration;
in response to receiving the instruction, generating, by the server device, a second playlist, wherein the second playlist includes references to a second plurality of audio files in a second specified order, wherein playout of the second plurality of audio files has a second duration that is: less than the first duration and within a threshold duration of the target duration; and
transmitting, by the server device over the wide area network, the second playlist to the client device, wherein reception of the second playlist at the client device causes the audio player application to, after playout of the current audio file completes, stop playing out audio files from the first plurality of audio files and retrieve and play out the second plurality of audio files in the second specified order.

US Pat. No. 10,606,879

INDEXING FINGERPRINTS

Gracenote, Inc., Emeryvi...

1. A system comprising:one or more processors;
a database storing reference fingerprints; and
memory including instructions that, when executed, cause the one or more processors to at least:
obtain a query fingerprint including a plurality of sub-fingerprints;
divide the reference fingerprints into segments, each segment includes a plurality of reference sub-fingerprints;
cause a plurality of threads to determine first bit error rates by performing, in parallel, one or more operations including determining one of the first bit error rates by comparing a portion of the plurality of sub-fingerprints of the query fingerprint with a portion of the plurality of reference sub-fingerprints of a segment;
identify a first predetermined number of lowest bit error rates based on the first bit error rates, each of the first predetermined number of lowest bit error rates corresponding to one of the reference fingerprints;
in response to identifying the first predetermined number of lowest bit error rates based on the first bit error rates, for each of the first predetermined number of lowest bit error rates, determine a second bit error rate by evaluating the query fingerprint against the corresponding one of the reference fingerprints; and
provide a predetermined number of response candidates based on the second bit error rate determined for each of the first predetermined number of lowest bit error rates.

US Pat. No. 10,007,863

LOGO RECOGNITION IN IMAGES AND VIDEOS

Gracenote, Inc., Emeryvi...

1. A method to detect a logo in images in video frames selected from a video stream, comprising:applying a saliency analysis and segmentation of selected regions in a selected video frame to determine segmented likely logo regions;
processing the segmented likely logo regions with feature matching using correlation to generate a first match, neural network classification using a convolutional neural network to generate a second match, and text recognition using character segmentation and string matching to generate a third match; and
deciding a most likely logo match by combining results from the first match, the second match, and the third match.

US Pat. No. 10,715,841

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device and a sequence of media content that includes a first portion followed by a second portion, the method comprising:presenting, by the client device, the first portion of the sequence of media content;
performing, by the client device, a content replacement operation, wherein performing the content replacement operation comprises presenting replacement media content instead of the second portion of the sequence of media content;
while performing the content replacement operation:
(i) monitoring, by the client device, advancement of the second portion of the sequence of media content, for steganographic data included in the second portion of the sequence of media content, and
(ii) based on the monitoring, determining, by the client device, that the second portion of the sequence of media content was interrupted by interrupting media content; and
responsive to determining that the second portion of the sequence of media content was interrupted by interrupting media content, switching, by the client device, from presenting the replacement media content to presenting the interrupting media content instead.

US Pat. No. 10,714,105

AUDIO FINGERPRINTING

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:a vector generator to:
determine first and second groups of frequencies in a plurality of frequencies from spectral data derived from audio data, the first group including frequencies different from frequencies in the second group of frequencies, each of the frequencies of the first group being higher than each of the frequencies in the second group,
identify a first subgroup of frequencies in the first group of frequencies based on energy values of the first group, each of the frequencies of the first subgroup having energy values that are greater than energy values of other frequencies in the first group,
identify a second subgroup of frequencies in the second group of frequencies based on energy values of the second group, each of the frequencies of the second subgroup having energy values that are greater than energy values of other frequencies in the second group, and
generate a vector that assigns a first value to the frequencies in the first subgroup and assigns a second value to the frequencies in the second subgroup;
a scrambler to generate permutations of the vector, the permutations differently arranging instances of the first and second values;
a coder to generate a sequence that indicates an instance of the first value or of the second value within a corresponding permutation of the permutations; and
a fingerprint generator to generate a fingerprint of the audio data based on the sequence, wherein the generation and decoding of the fingerprint is to conserve computing resources.

US Pat. No. 10,506,291

METHOD AND SYSTEM FOR PRESENTING ADDITIONAL CONTENT AT A MEDIA SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a server and from a client device, a request for replacement media content, wherein the request includes a first comparison fingerprint;
determining, by the server, that a plurality of reference fingerprints from a database match the first comparison fingerprint, wherein the plurality of reference fingerprints are associated with respective items of reference content;
receiving, by the server and from the client device, a second comparison fingerprint;
based on a quantity of the plurality of reference fingerprints that match the first comparison fingerprint being greater than one, comparing, by the server, the second comparison fingerprint to a second plurality of reference fingerprints, wherein the second plurality of reference fingerprints are generated from the respective items of reference content;
identifying, by the server and based on the comparing, a sequence of media content corresponding to the first comparison fingerprint and the second comparison fingerprint, wherein identifying the sequence of media content comprises identifying a reference time within the sequence of media content;
selecting, by the server, a sequence of additional media content for display in place of a portion of the sequence of media content in accordance with predefined criteria; and
providing, by the server to the client device, a response to the request for replacement media content, wherein the response includes information enabling the sequence of additional media content to be displayed in place of the portion of the sequence of media content at the client device at a subsequent time that is defined relative to the reference time.

US Pat. No. 10,462,512

MUSIC SERVICE WITH MOTION VIDEO

Gracenote, Inc., Emeryvi...

1. A computer-implemented method comprising:receiving primary audio content;
determining, by at least one hardware processor, that at least one reference audio content satisfies a predetermined similarity threshold based on a comparison of the primary audio content with the at least one reference audio content;
for each one of the at least one reference audio content, identifying motion video content based on the motion video content being stored in association with the one of the at least one reference audio content and not stored in association with the primary audio content; and
causing the identified motion video content to be displayed on a device concurrently with a presentation of the primary audio content on the device by (i) synchronizing data of the at least one reference audio content with data of the primary audio content and (ii) synchronizing the identified motion video content with the primary audio content based on the synchronizing of the data of the at least one reference audio content with the data of the primary audio content.

US Pat. No. 10,397,651

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on an earlier determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment, and
taking action based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,362,094

RETRIEVAL AND PLAYOUT OF MEDIA CONTENT

Gracenote Digital Venture...

1. A method comprising:accessing, by a client device, a playlist defining a sequence of media content items including a first media content item and a second media content item, wherein the first and second media content items are immediately adjacent to each other in the defined sequence;
using the accessed playlist, requesting and receiving, by the client device, from one or more server devices, first data representing the media content items of the accessed playlist, and using the received first data to playout the media content items; and
while playing out at least one of the media content items represented by the received first data:
(a) determining, by the client device, a status of an event, and using, by the client device, the determined status of the event as a basis to determine a condition, wherein the condition corresponds to a relationship between a content category type and respective content category types of at least one of the media content items that has been played out;
(b) making, by the client device, a determination that the determined condition has been satisfied; and
(c) responsive to making the determination that the determined condition has been satisfied, (i) requesting and receiving, by the client device, from the one or more server devices, second data representing a media content item that is related to the event and that is not specified in the accessed playlist, and (ii) using, by the client device, the retrieved second data to playout the media content item related to the event in between playing out the first and second media content items.

US Pat. No. 10,360,905

ROBUST AUDIO IDENTIFICATION WITH INTERFERENCE CANCELLATION

Gracenote, Inc., Emeryvi...

1. A method of content identification, the method comprising:identifying, by a processor and in a query, a first segment of query signatures in a sliding window that has a high match score with reference signatures of a reference and a second segment of query signatures in the sliding window that has a low match score with the reference signatures, wherein the second segment represents a distorted segment, wherein the query is based on an audio signal captured by a microphone;
evaluating, by the processor, query matches using the first segment with the high match score and not using the second segment with the low match score;
outputting, by the processor, a matching reference and match scores; and
performing a second analysis on a remote server storing more information on the reference than is stored on a portable smart device performing the method and having an acoustic profile of the query sent by the portable smart device to the server to identify different and finer segments for interfering sources.

US Pat. No. 10,229,689

AUDIO FINGERPRINTING

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by executing an instruction with at least one processor, spectral data stored in a database, the spectral data being derived from audio data and indicating a separate energy value for ones of a plurality of frequencies;
determining, by executing an instruction with the at least one processor, from the spectral data, a first group of frequencies and a second group of frequencies in the plurality of frequencies, the first group including frequencies that are higher than frequencies in the second group of frequencies;
in the first group of frequencies, identifying, by executing an instruction with the at least one processor, a first subgroup of frequencies wherein frequencies in the first subgroup have energy values that are higher than energy values of other frequencies in the first group;
in the second group of frequencies, identifying, by executing an instruction with the at least one processor, a second subgroup of frequencies wherein frequencies in the second subgroup have energy values that are higher than energy values of other frequencies in the second group;
creating, by executing an instruction with the at least one processor, a vector that assigns a first common value to frequencies in the first subgroup and assigns a second common value to frequencies in the second subgroup;
generating, by executing an instruction with the at least one processor, a sequence of permutations of the vector, the permutations differently arranging instances of the first and second common values;
generating, by executing an instruction with the at least one processor, a sequence of numbers that indicate a position of an instance of the first common value or of the second common value within a corresponding permutation among the permutations; and
reducing a computational overhead by generating, by executing an instruction with the at least one processor, a fingerprint of the audio data based on the sequence of numbers.

US Pat. No. 10,706,099

GENERATING AND DISTRIBUTING PLAYLISTS WITH MUSIC AND STORIES HAVING RELATED MOODS

Gracenote, Inc., Emeryvi...

1. A method comprising:obtaining, by a server device, an audio file containing music, wherein a client device and the server device are in communication with one another by way of a wide-area network;
determining, by the server device, a mood of the music from a plurality of moods;
looking up, by the server device, the mood of the music in a table, wherein the table identifies, for the mood of the music, other moods from the plurality of moods including those that match the mood of the music, are opposed to the mood of the music, and are neutral with respect to the mood of the music;
selecting, by the server device, an audio file containing a story, wherein the audio file containing the story is associated with a story attribute indicative of a second mood of the plurality of moods, wherein the second mood matches the mood of the music or is neutral with respect to the mood of the music;
automatically generating, by the server device, a playlist for the client device, wherein the playlist sequentially includes (i) reference to the audio file containing the music, and (ii) a reference to the audio file containing the story; and
transmitting, by the server device over the wide-area network, the playlist to the client device.

US Pat. No. 10,629,239

COMPUTING SYSTEM WITH DVE TEMPLATE SELECTION AND VIDEO CONTENT ITEM GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A non-transitory computer-readable medium having stored thereon program instructions that, upon execution by a processor, cause performance of a set of acts comprising:receiving, by a computing system, a first group of video content items;
identifying, by the computing system, from among the first group of video content items, a second group of video content items having a threshold extent of similarity with each other;
determining, by the computing system, a quality score for each video content item of the second group;
identifying, by the computing system, from among the second group of video content items, a third group of video content items each having a quality score that exceeds a quality score threshold; and
based on the identifying of the third group, using, by the computing system, at least a portion of at least one video content item of the identified third group and a digital video effect (DVE) template to generate a video content item.

US Pat. No. 10,567,835

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on an earlier determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment, and
taking action based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,440,430

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive media content to be presented by the media presentation device;
a media presentation interface for presenting the media content;
a network communication interface;
a processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the processing unit to carry out operations including:
generating a digital video fingerprint of the media content based on analysis of the media content, and outputting the generated digital video fingerprint for transmission through the network communication interface to a server,
after outputting the digital video fingerprint for transmission to the server, receiving from the server a request for an audio fingerprint of the media content when the server has determined that the digital video fingerprint matches reference video fingerprints corresponding with multiple channels,
responsive to the request, outputting a digital audio fingerprint of the media content for transmission through the network communication interface to the server, and
causing the media presentation device to present, in conjunction with the media content, supplemental channel-specific content associated with one of the multiple channels, the one channel being identified from among the multiple channels based on a determination that the digital audio fingerprint matches a reference audio fingerprint of just the one channel of the multiple channels.

US Pat. No. 10,341,697

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, at a media system, a sequence of media content;
accessing, at the media system, replacement media content, wherein the replacement media content is selected based on at least a portion of the sequence of media content; and
causing presentation of a displayed sequence of media content that includes at least a portion of the sequence of media content and at least a portion of the replacement media content, the presentation of the displayed sequence of media content including:
interrupting the at least the portion of the replacement media content by causing presentation of a further sequence of media content in response to a first request received during the presentation of the at least the portion of the replacement media content, the further sequence of media content being presented without presenting the replacement media content;
if a second request is received within a predetermined time of the first request, resuming presentation of the replacement media content in response to the second request; and
if the second request is received outside the predetermined time of the first request, presenting the sequence of media content responsive to the second request,
wherein the predetermined time is related to a duration of the replacement media content.

US Pat. No. 10,277,942

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to cause a computing system to carry out operations comprising:receiving from a media presentation device a communication providing a digital fingerprint of media content being rendered by the media presentation device;
receiving from the media presentation device an indication of location of the media presentation device;
determining that the digital fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the digital fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on the received indication of location of the media presentation device, to determine which channel that the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple reference fingerprints, determining that the respective channel with which the particular reference fingerprint corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.

US Pat. No. 10,261,963

GENERATING AND DISTRIBUTING PLAYLISTS WITH RELATED MUSIC AND STORIES

Gracenote, Inc., Emeryvi...

1. A method comprising:based on a profile associated with a client device, selecting, by a server device, an audio file containing music, wherein the client device and the server device are in communication with one another by way of a wide-area network, wherein the client device includes an audio player application, wherein the profile associated with the client device indicates a preferred language, and wherein the audio file containing the music is selected because at least some singing therein is in the preferred language;
identifying, by the server device, an attribute associated with the audio file containing the music, wherein the attribute is a keyword from lyrics associated with the music;
selecting, by the server device, a text file containing the keyword;
converting, by an automatic text-to-speech system, the text file containing the keyword to an audio file containing a story;
automatically generating, by the server device, a playlist for the client device, wherein the playlist sequentially includes (i) a reference to the audio file containing the music, and (ii) a reference to the audio file containing the story; and
transmitting, by the server device over the wide: area network, the playlist to the client device, wherein reception of the playlist at the client device causes the audio player application to sequentially retrieve and play out each of the audio file containing the music and the audio file containing the story.

US Pat. No. 10,261,964

GENERATING AND DISTRIBUTING PLAYLISTS WITH MUSIC AND STORIES HAVING RELATED MOODS

Gracenote, Inc., Emeryvi...

1. A method comprising:obtaining, by a server device, an audio file containing a story, wherein a client device and the server device are in communication with one another by way of a wide-area network, and wherein the client device includes an audio player application;
based on presence of pre-determined keywords or keyphrases within the story, determining, by the server device, a mood of the story;
looking up, by the server device, the mood of the story in a table, wherein the table identifies, for the mood of the story, a plurality of other moods including those that match the mood of the story, are opposed to the mood of the story, and are neutral with respect to the mood of the story;
selecting, by the server device, an audio file containing music, wherein the audio file containing the music is associated with a music attribute that is indicative of a second mood of the other moods, wherein the second mood matches the mood of the story or is neutral with respect to the mood of the story;
automatically generating, by the server device, a playlist for the client device, wherein the playlist sequentially includes (i) a uniform resource locator (URL) reference to the audio file containing the music, and (ii) a URL reference to the audio file containing the story; and
transmitting, by the server device over the wide area network, the playlist to the client device, wherein reception of the playlist at the client device causes the audio player application to sequentially retrieve and play out each of the audio file containing the music and the audio file containing the story.

US Pat. No. 10,735,782

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, at a media system, a sequence of media content from a media content distributor over a communication network, the sequence of media content including interstitial advertisements, and the sequence of media content being watermarked with control data that includes timing information of when in the sequence of media content at least one replacement advertisement should be presented in place of at least one of the interstitial advertisements, wherein at least a portion of the watermarked control data includes encrypted control data;
outputting, by the media system, the sequence of media content for presentation;
while outputting the sequence of media content for presentation, (i) detecting, by the media system, the watermarked control data in the sequence of media content and (ii) determining, by the media system, from the detected watermarked control data, the timing information of when in the sequence of media content the at least one replacement advertisement should be presented;
responsive to detecting the watermarked control data in the sequence of media content, transmitting from the media system to a content-replacement system a replacement-content request, including in the replacement-content request the encrypted control data without decrypting the encrypted control data;
receiving at the media system, in response to the replacement-content request, the at least one replacement advertisement; and
causing, by the media system, the at least one replacement advertisement to be presented in accordance with the determined timing information, in place of content of the sequence of media content.

US Pat. No. 10,698,948

AUDIO MATCHING BASED ON HARMONOGRAM

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:accessing means to access audio data that represents query sound to be identified;
first creating means to create a spectrogram of the audio data, different segments of the spectrogram representing amplitudes at frequencies in different time slices of the query sound;
determining means to determine a dominant frequency in a time slice of the query sound based on a segment of the spectrogram, the determining means including:
calculating means to calculate an aggregate energy value of a candidate frequency based on amplitudes of the candidate frequency and harmonics thereof represented in the segment of the spectrogram; and
identifying means to identify the candidate frequency as the dominant frequency based on the aggregate energy value of the candidate frequency being a largest aggregate energy value among aggregate energy values of frequencies whose amplitudes are represented in the segment of the spectrogram;
second creating means to create a query harmonogram of the audio data, different segments of the query harmonogram representing aggregate energy values of dominant frequencies in different time slices of the query sound; and
providing means to provide an identifier of the query sound based on a comparison of the query harmonogram to a reference harmonogram mapped to the identifier by a database.

US Pat. No. 10,613,821

MACHINE-LED MOOD CHANGE

Gracenote, Inc., Emeryvi...

1. A system comprising:a sensor in a vehicle to obtain a biometric measurement of a driver of the vehicle;
a play count accessor to access play counts that quantify playback of media files; and
a media selector machine to:
compare play counts of the media files to a threshold number of playbacks;
categorize respective ones of the media files as familiar media files when the corresponding play counts satisfy the threshold number of playbacks;
categorize respective ones of the media files as unfamiliar media files when the corresponding play counts do not satisfy the threshold number of playbacks;
access user state scores of the of media files;
select a target user state score based on the biometric measurement of the driver; and
select, as a selected media file, one of the media files to present to the driver based on the user state score of the selected media file, the target user state score, and a status of the selected media file as a familiar media file or an unfamiliar media file.

US Pat. No. 10,373,611

MODIFICATION OF ELECTRONIC SYSTEM OPERATION BASED ON ACOUSTIC AMBIENCE CLASSIFICATION

Gracenote, Inc., Emeryvi...

1. A method, comprising:detecting, using a microphone, at least one audio signal present in a physical environment while a media player is playing a song having a genre, the at least one audio signal including first audio data corresponding to the song, second audio data associated with a person exposed to the song, and third audio data associated with noise in the physical environment different than the first audio data and the second audio data;
cancelling, using the least one hardware processor, the third audio data from the at least one audio signal to generate at least one filtered audio signal;
analyzing the at least one filtered audio signal to extract at least one audio feature from the at least one audio signal;
classifying, by at least one hardware processor, the at least one filtered audio signal based on the at least one audio feature to produce a classification of the at least one filtered audio signal, the classification including a reaction of the person to the media content being played by the media player; and
based on the genre and the reaction, automatically modifying, using the least one hardware processor, operation of the media player by modifying a playlist of media content played by the media player to include more songs of the genre.

US Pat. No. 10,229,198

VIDEO MATCHING SERVICE TO OFFLINE COUNTERPART

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving an address for an online media asset, the online media asset including at least one of audio or video;
receiving a metadata associated with the online media asset, wherein the metadata includes an asset attribute;
identifying whether the asset attribute is a strong signal attribute, wherein strong signal attributes include a list of cast or a list of crew; and
using the metadata at least in part by prioritizing strong signal attributes over weaker signal attributes to match the online media asset to a corresponding offline media asset, the offline media asset including at least one of audio or video.

US Pat. No. 10,104,426

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION BASED ON REFERENCE STREAM COMPARISON

Gracenote, Inc., Emeryvi...

1. A method comprising:obtaining, by a computing system, reference fingerprints that represent reference media streams, wherein each reference fingerprint represents a respective one of the reference media streams, and wherein each reference media stream is broadcast on a respective channel;
detecting by the computing system, based on a comparison of the reference fingerprints with each other, that a plurality of the reference fingerprints match each other, and responsively flagging the plurality of reference fingerprints as being members of a multi-match group;
thereafter determining by the computing system that a query fingerprint representing a media stream being rendered by a media presentation device matches the multi-match group, wherein the determining that the query fingerprint matches the multi-match group includes (i) finding, based on a comparison of the query fingerprint with a given reference fingerprint, that the query fingerprint matches the given reference fingerprint and then (ii) finding that the given reference fingerprint is flagged as being one of the members of the multi-match group;
responsive to the determining, using by the computing system one or more attributes of the query fingerprint as a basis to determine which reference fingerprint of the multi-match group best matches the query fingerprint and to determine that the channel on which the reference media stream represented by the determined reference fingerprint is broadcast is the channel on which the media stream being rendered by the media presentation device is being broadcast; and
taking action by the computing system based on the determined channel.

US Pat. No. 10,685,058

BROADCAST PROFILING SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:determining first identifying information associated with first broadcast data for a first content station;
analyzing the first identifying information to:
determine at least one of a first genre, first era, or first mood of the first broadcast data associated with a first time context;
store at least one of the first genre, first era, or first mood of the first broadcast data associated with the first time context in a datastore; and
update a first count associated with the first time context;
determining second identifying information associated with second broadcast data for the first content station;
analyzing the second identifying information to:
determine at least one of a second genre, second era, or second mood of the second broadcast data associated with a second time context;
store at least one of the second genre, second era, or second mood of the second broadcast data associated with the second time context in the datastore; and
update a second count associated with the second time context;
generating a content station profile of the first content station based on (1) the stored at least one of the first genre, first era, or first mood of the first broadcast data associated with the first time context and the first count and (2) the stored at least one of the second genre, second era, or second mood of the second broadcast data associated with the second time context and the second count;
comparing a preference included in a user profile with a portion of the content station profile to determine whether the preference included in the user profile satisfies a threshold difference from the portion of the content station profile;
in response to the threshold difference being satisfied, generate a station recommendation for a user associated with the user profile; and
transmitting an instruction to a device associated with the user, the instruction including the station recommendation, the instruction configured to cause a radio pre-set to be adjusted.

US Pat. No. 10,382,817

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON TIME OF BROADCAST

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on a time of broadcast of the media content being rendered by the media presentation device, to determine which channel the media presentation device is receiving the media content on; and
taking action based on the determined channel.

US Pat. No. 10,311,100

GENERATING AND DISTRIBUTING A REPLACEMENT PLAYLIST

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a client device, a first playlist, wherein the first playlist was sent by a server device over a wide area network, wherein the first playlist includes references to a first plurality of audio files in a first specified order, wherein playout of the first plurality of audio files has a first duration;
in response to receiving the first playlist, causing, by the client device, an audio player application executing on the client device to retrieve and play out at least some of the first plurality of audio files in the first specified order;
while the client device is playing out a current audio file of the first plurality of audio files, receiving, by the client device, an instruction specifying that a sleep timer has been activated with a target duration;
in response to receiving the instruction, transmitting, by the client device to the server device, an indication that the sleep timer has been activated with the target duration;
receiving, by the client device, a second playlist from the server device, wherein the second playlist includes references to a second plurality of audio files in a second specified order, wherein playout of the second plurality of audio files has a second duration that is: less than the first duration and within a threshold duration of the target duration; and
in response to receiving the second playlist, causing, by the client device, the audio player application to, after playout of the current audio file completes, stop playing out audio files from the first plurality of audio files and retrieve and play out the second plurality of audio files in the second specified order.

US Pat. No. 10,313,747

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON TIME OF BROADCAST

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on a time of broadcast of the media content being rendered by the media presentation device, to determine which channel the media presentation device is receiving the media content on; and
taking action based on the determined channel.

US Pat. No. 10,199,071

COMPUTING SYSTEM WITH DVE TEMPLATE SELECTION AND VIDEO CONTENT ITEM GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, a first group of video content items;
determining, by the computing system, a quality score for each video content item of the first group;
identifying, by the computing system, from among the first group of video content items, a second group of video content items each having a quality score that exceeds a quality score threshold;
identifying, by the computing system, from among the second group of video content items, a third group of video content items having a threshold extent of similarity with each other; and
based on the identifying of the third group, transmitting, by the computing system, at least a portion of at least one video content item of the identified third group to a digital video-effect (DVE) system, wherein the DVE system is configured for (i) receiving the transmitted at least the portion of the at least one video content item of the identified third group, and (ii) using (a) the received at least the portion of the at least one video content item of the identified third group and (b) a DVE template, to generate a video content item.

US Pat. No. 10,037,780

COMPUTING SYSTEM WITH VIDEO CONTENT GENERATION FEATURE

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, first video content captured by a front-facing camera of an end-user device, wherein the first video content comprises video content of a reporter reporting on an event;
receiving, by the computing system, second video content captured by a rear-facing camera of the end-user device, wherein the second video content comprises video content of the event, and wherein the first video content is captured by the front-facing camera while the second video content is captured by the rear-facing camera;
accessing, by the computing system, a program schedule of a video program;
based on the accessed program schedule, selecting, by the computing system, a digital video effect (DVE) template for generating third video content that includes the received first video content and the received second video content; and
using, by the computing system, the received first video content, the received second video content, and the selected DVE template to generate the third video content that includes the received first video content and the received second video content.

US Pat. No. 9,930,406

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, by a computing system, that a digital video fingerprint of media content being rendered by a media presentation device matches multiple reference video fingerprints each corresponding with a different respective channel;
responsive to at least determining that the digital video fingerprint matches the multiple reference video fingerprints each corresponding with a different respective channel, performing disambiguation by the computing system based at least in part on a determination that a digital audio fingerprint of the media content being rendered by the media presentation device matches a reference audio fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein performing disambiguation based at least in part on the determination that the digital audio fingerprint of the media content being rendered by the media presentation device matches the reference audio fingerprint corresponding with just the single channel comprises (i) comparing the digital audio fingerprint of the media content being rendered by the media presentation device with reference audio fingerprints corresponding with the same channels as the multiple reference video fingerprints, and (ii) based on the comparing, determining that the digital audio fingerprint of the media content being rendered by the media presentation device matches just a single one of the reference audio fingerprints; and
taking action by the computing system based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,402,443

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method of generating signatures from features in a sequence of frames of a media clip executed on one or more hardware devices, the method comprising:generating a plurality of N-dimensional signatures for features detected in a frame and in a following frame of the media clip;
computing signature bit distances between respective N-dimensional signatures in the frame and in the following frame of the media clip for a robustness analysis;
eliminating N-dimensional signatures from the plurality of N-dimensional signatures that have an associated signature bit distance greater than a first robustness threshold R that are not temporally robust to create a set of remaining N-dimensional signatures that are temporally robust in representing the media clip; and
storing N-dimensional signatures of the set of remaining N-dimensional signatures and an identifier of the media clip at locations in a reference signature database.

US Pat. No. 10,250,916

METHOD AND DEVICE FOR GENERATING AND DETECTING A FINGERPRINT FUNCTIONING AS A TRIGGER MARKER IN A MULTIMEDIA SIGNAL

Gracenote, Inc., Emeryvi...

1. A system comprising:a memory that stores instructions; and
one or more processors configured by the instructions to perform operations comprising:
playing multimedia content by a multimedia playback device;
while playing the multimedia content by the multimedia playback device, deriving fingerprints from respective segments of the multimedia content;
comparing the derived fingerprints to one or more reference fingerprints representing features of the multimedia content, wherein at least one reference fingerprint of the one or more reference fingerprints is associated with an action;
determining that one of the derived fingerprints matches the at least one reference fingerprint; and
in response to determining that the one of the derived fingerprints matches the at least one reference fingerprint, causing execution of the action associated with the at least one reference fingerprint,
wherein the action to be executed is associated with a time point indicating when, in the multimedia content, the action is to be executed.

US Pat. No. 10,223,358

SELECTING BALANCED CLUSTERS OF DESCRIPTIVE VECTORS

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by one or more processors, descriptive vectors that describe items, each descriptive vector having multiple dimensions whose values indicate extents to which multiple characteristics are present in a different item among the items;
calculating, by the one or more processors, vector distances between pairs of the descriptive vectors;
generating, by the one or more processors, a hierarchy of vector clusters by clustering the descriptive vectors into the vector clusters based on the calculated vector distances;
determining, by the one or more processors, centroid vectors of the vector clusters in the hierarchy by calculating mean vectors of the vector clusters, each mean vector and each centroid vector representing a different vector cluster in the hierarchy;
summing, by the one or more processors, inter-cluster vector distances between pairs of the centroid vectors;
summing, by the one or more processors, intra-cluster vector distances between pairs of descriptive vectors in each of the vector clusters;
calculating, by the one or more processors, scores of the hierarchy based on the summed inter-cluster vector distances and on the summed intra-cluster vector distances, each of the scores being calculated based on a different scalar among a plurality of scalars among which a scalar corresponds to an extreme score among the calculated scores; and
selecting, by the one or more processors, a subset of the vector clusters in the hierarchy based on the scalar that corresponds to the extreme score.

US Pat. No. 10,740,390

GENERATING AND DISTRIBUTING A REPLACEMENT PLAYLIST

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a client device, a message specifying that a timer has been activated with a target duration, wherein the client device is playing out a current audio file of a first playlist, wherein the first playlist contains references to a first plurality of audio files, and wherein playout of the first plurality of audio files has a first duration;
transmitting, by the client device and to a server device, an indication that the timer has been activated with the target duration;
receiving, by the client device and from the server device, a second playlist containing references to a second plurality of audio files, wherein playout of the second plurality of audio files has a second duration that is: less than the first duration and within a threshold duration of the target duration; and
causing, by the client device, an audio player to, after playout of the current audio file completes, stop playing out audio files from the first plurality of audio files and retrieve and play out the second plurality of audio files.

US Pat. No. 10,567,836

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A method comprising:determining by a computing system that digital fingerprint data representing a first segment of media content being rendered by a media presentation device matches multiple instances of reference fingerprint data each corresponding with a different respective channel;
after determining that the digital fingerprint data representing the first segment of media content being rendered by the media presentation device matches the multiple instances of reference fingerprint data each corresponding with a different respective channel, performing by the computing system disambiguation based at least in part on an earlier determination that digital fingerprint data representing a second segment of the media content being rendered by the media presentation device matches reference fingerprint data corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the second segment of the media content being rendered by the media presentation device is before the first segment of the media content being rendered by the media presentation device; and
taking action by the computing system based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 10,506,275

DYNAMIC CONTROL OF FINGERPRINTING RATE TO FACILITATE TIME-ACCURATE REVISION OF MEDIA CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a computing system, query fingerprints representing a media stream being received by a client, wherein the query fingerprints are generated by the client at a frame rate;
determining by a computing system that the media stream being received by the client is a known media stream, wherein the determining is based on a comparison by the computing system of (i) the query fingerprints generated at the frame rate with (ii) reference fingerprints representing the known media stream;
responsive to the determining, causing by the computing system the client to increase the frame rate at which the client generates the query fingerprints being received by the computing system;
using by the computing system the query fingerprints generated at the increased frame rate as a basis to establish synchronous lock between (i) true time defined along a timeline within the known media stream and (ii) client time defined according to a clock of the client;
using by the computing system the established synchronous lock as a basis to determine a client-time point at which the client should perform a content revision in the media stream being received by the client; and
causing the client to perform, at the determined client-time point, the content revision.

US Pat. No. 10,321,188

MEDIA CHANNEL IDENTIFICATION WITH VIDEO MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON AUDIO FINGERPRINT

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive media content to be presented by the media presentation device;
a media presentation interface for presenting the media content;
a network communication interface;
a processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the processing unit to carry out operations including:
generating a digital video fingerprint of the media content based on analysis of the media content, and outputting the generated digital video fingerprint for transmission through the network communication interface to a server,
after outputting the digital video fingerprint for transmission to the server, receiving from the server a request for an audio fingerprint of the media content when the server has determined that the digital video fingerprint matches reference video fingerprints corresponding with multiple channels,
responsive to the request, outputting a digital audio fingerprint of the media content for transmission through the network communication interface to the server, and
causing the media presentation device to present, in conjunction with the media content, supplemental channel-specific content associated with one of the multiple channels, the one channel being identified from among the multiple channels based on a determination that the digital audio fingerprint matches a reference audio fingerprint of just the one channel of the multiple channels.

US Pat. No. 10,250,919

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICES

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, by one or more processors of a media system operatively connected to a speaker device, a sequence of media content, wherein the sequence of media content is outputted by the speaker device of the media system;
accessing a fingerprint of a first portion of the sequence of media content;
comparing the fingerprint with a plurality of reference fingerprints stored in a database;
based on the comparison, identifying the sequence of media content and determining a position of the first portion within the sequence of media content;
accessing replacement media content at the media system, wherein the replacement media content is selected based at least in part on the identified sequence of media content;
determining, by the one or more processors, an offset position in the sequence of media content relative to the position of the first portion, the offset position corresponding to a second portion of the sequence of media content; and
causing the replacement media content to be outputted by the speaker device of the media system beginning at the determined offset position.

US Pat. No. 10,623,800

DYNAMIC CONTROL OF FINGERPRINTING RATE TO FACILITATE TIME-ACCURATE REVISION OF MEDIA CONTENT

Gracenote, Inc., Emeryvi...

1. A method comprising:generating query fingerprints representing a media stream being received by a client, wherein the query fingerprints are generated at a frame rate;
providing the generated query fingerprints for use to identify the media stream being received by the client based on comparison with reference fingerprints representing a known media stream;
after identification of the media stream being received by the client, increasing the frame rate of generating the query fingerprints;
providing the query fingerprints generated at the increased frame rate, for use to establish synchronous lock between (i) true-time defined along a timeline within the known media stream and (ii) client-time defined according to a clock of the client; and
performing, at a specific client-time point, a content revision of the media stream being received by the client, wherein the specific client-time point is determined based on the established synchronous lock.

US Pat. No. 10,579,668

MEDIA FINGERPRINTING AND IDENTIFICATION SYSTEM

Gracenote, Inc., Emeryvi...

1. A method comprising:determining, for respective sub-regions of a set of sub-regions that each include pixels of a frame of a query video, using a computing device, respective pixel color data corresponding to pixels included within the respective sub-regions;
comparing, using the computing device, the respective pixel color data for the respective sub-regions with at least one median color value;
determining, for the respective sub-regions, using the computing device, respective descriptor values based on the comparing;
generating, using the computing device, fingerprint data for the frame of the query video based on the respective descriptor values for the respective sub-regions;
providing, by the computing device, the fingerprint data to a server over a network, wherein the server has access to reference fingerprint data of a reference database, and wherein the reference fingerprint data corresponds to a plurality of reference videos and is derived using pixel color data for the plurality of reference videos; and
responsive to providing the fingerprint data to the server, receiving, by the computing device, a message that is generated based on a determination that the query video corresponds to a reference video of the reference database.

US Pat. No. 10,558,674

METHODS AND APPARATUS FOR DETERMINING A MOOD PROFILE ASSOCIATED WITH MEDIA DATA

Gracenote, Inc., Emeryvi...

1. A method comprising:accessing, from a database, a first mood vector that describes first media data and specifies a first mood-value pair, the first mood-value pair assigning a first value to a first mood;
accessing, from the database, a second mood vector that describes a second media data and specifies a second mood-value pair, the second mood-value pair assigning a second value to a second mood;
retrieving a first score that quantifies congruency between the first and second moods, the first score being retrieved from a data structure that correlates the first and second moods;
using a processor, generating a second score that quantifies congruency between the first and second mood vectors, the generating the second score being based on the accessed first and second values and the retrieved first score; and
based on at least the generated second score, providing an indicator to an application.

US Pat. No. 10,475,426

CHARACTERIZING AUDIO USING TRANSCHROMAGRAMS

Gracenote, Inc., Emeryvi...

1. An apparatus comprising:a transchromagram generator to generate a data structure based on a set of transition matrices corresponding to a plurality of time frames of audio data, the data structure indicative of probabilities that first musical notes will transition to second musical notes:
a database controller to prompt a database to store the data structure within the audio data; and
a notification manager to generate, based on a comparison between query audio data and the stored data structure of the audio data, a notification identifying at least one characteristic of the query audio data.

US Pat. No. 10,318,813

DIGITAL VIDEO FINGERPRINTING USING MOTION SEGMENTATION

Gracenote, Inc., Emeryvi...

1. A method of processing video to generate signatures for motion segmented regions over two or more frames, the method comprising:differencing two frames using an adaptive threshold to generate a two-frame difference image;
creating a motion history image from the two-frame difference image;
segmenting the motion history image to generate one or more motion segmented regions;
generating a descriptor and a signature for a selected motion segmented region; and
aligning a background by:
computing a distance between a pixel at location x on each frame and a pixel at the location x on the aligned background for the location x in the two-frame difference image, and
comparing the distance for each location x to the adaptive threshold to generate foreground pixels used in the segmenting of the motion history image.

US Pat. No. 10,321,187

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON SINGLE-MATCH

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to carry out operations comprising:determining that a digital fingerprint of a first segment of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
after determining that the digital fingerprint of the first segment matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on an earlier determination that a digital fingerprint of a second segment of the media content being rendered by the media presentation device matches a reference fingerprint corresponding with just a single channel, the disambiguation establishing that the media presentation device is receiving the media content on the single channel, wherein the first and second segments of the media content being rendered by the media presentation device are first and second respective time segments of the media content being rendered by the media presentation device, wherein the second time segment is before the first time segment, and
taking action based on the establishing that the media presentation device is receiving the media content on the single channel.

US Pat. No. 11,055,346

TAGGING AN IMAGE WITH AUDIO-RELATED METADATA

GRACENOTE, INC., Emeryvi...


1. A computing device, comprising:a camera;
a microphone;
one or more processors; and
a non-transitory, computer-readable medium storing instructions that, when executed by the one or more processors, cause the computing device to perform a set of acts comprising:
detecting one or more predetermined phrases indicating a request to capture an image;
in response to detecting the one or more predetermined phrases indicating a request to capture an image, using the microphone of the computing device to capture audio content
receiving a request to use the camera of the computing device;
in response to receiving the request, using the camera of the computing device to capture an image;
generating audio fingerprint data of the captured audio content;
storing the audio fingerprint data with the captured image on the computing device;
receiving a user input representing a selection of the captured image on the computing device;
in response to receiving the user input, using the stored audio fingerprint data to identify reference audio content that has at least a threshold extent of similarity with the captured audio content; and
outputting an indication of the identified reference audio content while displaying the captured image.

US Pat. No. 10,764,358

RETRIEVAL AND PLAYOUT OF MEDIA CONTENT

Gracenote Digital Venture...

1. A method comprising:accessing, by a client device, a playlist defining a sequence of media content items including a first media content item and a second media content item;
using the accessed playlist, requesting and receiving, by the client device, from one or more server devices, first data representing the media content items of the accessed playlist, and using the received first data to playout the media content items; and
while playing out at least one of the media content items represented by the received first data:
(a) determining, by the client device, a status of an event, and using, by the client device, the determined status of the event as a basis to determine a condition;
(b) making, by the client device, a determination that the determined condition has been satisfied; and
(c) responsive to making the determination that the determined condition has been satisfied, (i) requesting and receiving, by the client device, from the one or more server devices, second data representing a media content item that is related to the event and that is not specified in the accessed playlist, and (ii) using, by the client device, the retrieved second data to playout the media content item related to the event in between playing out the first and second media content items.

US Pat. No. 10,679,672

SYNTHESIZING A PRESENTATION FROM MULTIPLE MEDIA CLIPS

Gracenote, Inc., Emeryvi...

1. A system comprising:a processor-implemented media ingestion module configured to access a plurality of media clips including a first video clip with a first audio waveform and a second video clip with a second audio waveform;
a media analysis module configured to:
match a first acoustic fingerprint of at least a part of the first audio waveform of the first video clip with a second acoustic fingerprint of at least a part of the second audio waveform of the second video clip;
match a first event location of at least a part of the first video clip with a second event location of at least a part of the second video clip; and
determine an audio overlap of the first video clip with the second video clip based at least in part on the match of the first acoustic fingerprint of at least the part of the first audio waveform with the second acoustic fingerprint of at least the part of the second audio waveform and the match of the first event location of at least the part of the first video clip with the second event location of at least the part of the second video clip; and
a content creation module configured to:
merge the first video clip and the second video clip into a group of overlapping video clips based on the audio overlap of the first video clip with the second video clip; and
generate a presentation that includes the group formed by merging the first video clip and the second video clip based on the audio overlap to generate the presentation.

US Pat. No. 10,614,582

LOGO RECOGNITION IN IMAGES AND VIDEOS

Gracenote, Inc., Emeryvi...

1. A method to detect a specific brand in images and video streams, comprising:accepting luminance images at a scale in an x direction Sx and a different scale in a y direction Sy in a neural network; and
training the neural network with a set of training images for detected features associated with a specific brand.

US Pat. No. 10,595,097

INSERTING ADVERTISEMENTS INTO VIDEO CONTENT

GRACENOTE, INC., Emeryvi...

1. A method, comprising:determining one or more fingerprints of video content currently playing at a client device;
by a processor, identifying one or more frames of the video content to be replaced with supplemental video content based on the determined one or more fingerprints, wherein the one or more frames of video content are at least one frame fewer than the supplemental video content; and
replacing the identified one or more frames of the video content and at least one frame adjacent to the identified one or more frames with the supplemental video content without shifting the at least one frame adjacent to the identified one or more frames.

US Pat. No. 10,579,671

GENERATING AND DISTRIBUTING A REPLACEMENT PLAYLIST

Gracenote, Inc., Emeryvi...

1. A method comprising:receiving, by a client device, an instruction specifying that a timer has been activated with a target duration, wherein the client device is playing out a current audio file of a first plurality of audio files, wherein the first plurality of audio files has a first specified order, and wherein playout of the first plurality of audio files has a first duration;
transmitting, by the client device and to a server device, an indication that the timer has been activated with the target duration;
receiving, by the client device and from the server device, a second playlist containing references to a second plurality of audio files in a second specified order, wherein playout of the second plurality of audio files has a second duration that is: less than the first duration and within a threshold duration of the target duration; and
causing, by the client device, an audio player to, after playout of the current audio file completes, stop playing out audio files from the first plurality of audio files and retrieve and play out the second plurality of audio files in the second specified order.

US Pat. No. 10,757,159

RETRIEVAL AND PLAYOUT OF MEDIA CONTENT

GRACENOTE DIGITAL VENTURE...

1. A method comprising:accessing a playlist defining a sequence of media content items including a first media content item and a second media content item;
retrieving, from one or more server devices, first data representing the media content items of the accessed playlist, and using the retrieved first data to play out, via a client device, the media content items in accordance with the sequence defined by the playlist;
accessing second data representing user activity related to the client device, and using the accessed second data as a basis to select a media content item from a plurality of media content items, wherein the user activity comprises user input indicating whether a user has used a particular feature provided by a software application of the client device, wherein the particular feature is a feature other than a request to play out media content, and wherein the selected media content item includes content that provides instructions on how to use the particular feature; and
retrieving third data representing the selected media content item, and using the retrieved third data to play out, via the client device, the selected media content item in between playing out the first and second media content items.

US Pat. No. 10,701,410

METHOD AND SYSTEM FOR REMOTELY CONTROLLING CONSUMER ELECTRONIC DEVICE

Gracenote, Inc., Emeryvi...

1. A method for use in connection with a client device that is configured to present media content sequences, the method comprising:receiving, by the client device, one or more reference fingerprints, wherein the one or more reference fingerprints is associated with channel data for a particular channel;
receiving, by the client device, one or more media content sequences;
generating, by the client device, one or more fingerprints of the received one or more media content sequences;
detecting a match between (i) the received one or more reference fingerprints and (ii) the generated one or more fingerprints; and
responsive to detecting the match, sending, by the client device, to one or more server devices, a message that comprises the channel data associated with the particular channel, wherein the particular channel is different from a former channel that the client device previously determined that it was tuned to based on one or more additional reference fingerprints received by the client device.

US Pat. No. 10,531,150

METHOD AND SYSTEM FOR DETECTING AND RESPONDING TO CHANGING OF MEDIA CHANNEL

Gracenote, Inc., Emeryvi...

1. A method of detecting and responding to a change of channels at a media presentation device, the method comprising:while the media presentation device is presenting media content that was determined to be of a first channel, detecting a threshold mismatch between (i) a query fingerprint stream representing the media content being presented by the media presentation device and (ii) a reference fingerprint stream representing media content of the first channel, wherein the media content being presented by the media presentation device was determined to be of the first channel by comparing the query fingerprint stream with the reference fingerprint stream using a first fingerprint comparison rate and, based on the comparing, detecting a match between the query fingerprint stream and the reference fingerprint stream, wherein detecting the threshold mismatch is done by comparing the query fingerprint stream with the reference fingerprint stream using a second fingerprint comparison rate greater than the first fingerprint comparison rate, and wherein the threshold mismatch indicates a likelihood that the media presentation device has transitioned from presenting the first channel to presenting a second channel;
responsive to detecting the threshold mismatch, changing an overlay on the presented media content, wherein changing the overlay involves removing superimposed content associated with the first channel.

US Pat. No. 10,524,000

MEDIA CHANNEL IDENTIFICATION AND ACTION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON MATCHING WITH DIFFERENTIAL REFERENCE-FINGERPRINT FEATURE

Gracenote, Inc., Emeryvi...

1. A media presentation device comprising:a media input interface through which to receive video content to be presented by the media presentation device, wherein the video content includes video frames having video frame regions, the video frame regions comprising a center, an edge, and a corner;
a media presentation interface for presenting the received video content;
a network communication interface;
a processing unit;
non-transitory data storage; and
program instructions stored in the non-transitory data storage and executable by the processing unit to carry out operations comprising:
generating a first query fingerprint of the video content based on analysis of the video content, and outputting the generated first query fingerprint for transmission through the network communication interface to a server,
receiving from the server, after outputting the first query fingerprint, a request for a second query fingerprint specifically focused on an identified video frame region of the video frame regions of the video content, wherein the identified video frame region defines a difference between multiple channels that each have a reference fingerprint matching the first query fingerprint,
outputting, for transmission through the network communication interface to the server, the requested second query fingerprint specifically focused on the identified video frame region of the video content, and
causing the media presentation device to present, in conjunction with the video content that the media presentation device is presenting, supplemental channel-specific content associated with one of the multiple channels, the one channel being identified from among the multiple channels based on a determination that the second query fingerprint matches a reference fingerprint of just the one channel of the multiple channels.

US Pat. No. 10,451,952

SYSTEMS AND METHODS TO TRANSFORM EVENTS AND/OR MOOD ASSOCIATED WITH PLAYING MEDIA INTO LIGHTING EFFECTS

GRACENOTE, INC., Emeryvi...

1. An apparatus comprising:a content identifier to identify a first event occurring during presentation of media content at a first time; and
a content driven analyzer to:
determine a first lighting effect to be produced by a light-producing device based on the first event; and
instruct the light-producing device to produce the first lighting effect based on the first event during presentation of the media content, the content identifier to identify a second media event occurring during presentation of the media content at a second time after the first time, the content driven analyzer to instruct the light-producing device to one of maintain the first lighting effect based on the second event or produce a second lighting effect based on the second event during presentation of the media content.

US Pat. No. 10,412,448

MEDIA CHANNEL IDENTIFICATION WITH MULTI-MATCH DETECTION AND DISAMBIGUATION BASED ON LOCATION

Gracenote, Inc., Emeryvi...

1. A non-transitory computer readable medium having stored thereon instructions executable by a processing unit to cause a computing system to carry out operations comprising:receiving from a media presentation device a communication providing a digital fingerprint of media content being rendered by the media presentation device;
receiving from the media presentation device an indication of location of the media presentation device;
determining that the digital fingerprint of media content being rendered by a media presentation device matches multiple reference fingerprints each corresponding with a different respective channel;
responsive to determining that the digital fingerprint matches the multiple reference fingerprints each corresponding with a different respective channel, performing disambiguation based at least in part on the received indication of location of the media presentation device, to determine which channel that the media presentation device is receiving the media content on, wherein performing the disambiguation comprises, based at least in part on a match between the location of the media presentation device and a location associated with a particular one of the multiple reference fingerprints, determining that the respective channel with which the particular reference fingerprint corresponds is the channel that the media presentation device is receiving media content on; and
taking action based on the determined channel.