Davide Alberani | 6 Apr 2011 20:09
Picon
Gravatar

Re: IMDb IDs aren't really necessary for names and titles

On Thu, Mar 31, 2011 at 06:13, Michael Grier
<mr.michael.grier@...> wrote:
>
> It does work; you have to have the comma in there... (%2C)
>
> http://www.imdb.com/Name?Gibson%2C%20Mel%20%28I%29
>
> redirects to
>
> http://www.imdb.com/name/nm0000154/

Right; I'll look into integrating this solution instead of the current search
done to convert from titles/names to imdbID, thanks!

--

-- 
Davide Alberani <davide.alberani@...>  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
Xperia(TM) PLAY
It's a major breakthrough. An authentic gaming
smartphone on the nation's most reliable network.
And it wants your games.
http://p.sf.net/sfu/verizon-sfdev
Michael Grier | 8 Apr 2011 23:38
Picon

Re: IMDb IDs aren't really necessary for names and titles

On 4/6/11, Davide Alberani <davide.alberani@...> wrote:
> On Thu, Mar 31, 2011 at 06:13, Michael Grier <mr.michael.grier@...>
> wrote:
>>
>> It does work; you have to have the comma in there... (%2C)
>>
>> http://www.imdb.com/Name?Gibson%2C%20Mel%20%28I%29
>>
>> redirects to
>>
>> http://www.imdb.com/name/nm0000154/
>
> Right; I'll look into integrating this solution instead of the current
> search
> done to convert from titles/names to imdbID, thanks!

Some notes:
1. If you want to grab the nm ids, you will have to do it before you
canonicalize any names that IMDb did not in the flat files, OR save
the original name when you canonicalize a name.

2. You do not incur a "too many requests" type of penalty (I forget
what the actual message is, but you probably know what I'm talking
about) when you use the method I mentioned earlier to get the id from
the Location header, but I would be wary of doing it too much (like
during flat files import). Your ip could get banned. It also would
cause import to take much longer.

3. It won't find:
    A: Anything with a + (plus symbol) in the name or title.
(Continue reading)

Michael Grier | 8 Apr 2011 23:44
Picon

Re: IMDb IDs aren't really necessary for names and titles

> 1. If you want to grab the nm ids, you will have to do it before you
> canonicalize any names that IMDb did not in the flat files, OR save
> the original name when you canonicalize a name.

Maybe just add another BOOL field to the db to indicate if the import
script canonicalized the name or not. Then later, if it was
canonicalized, normalize it before trying to send off the url.

On 4/8/11, Michael Grier <mr.michael.grier@...> wrote:
> On 4/6/11, Davide Alberani <davide.alberani@...> wrote:
>> On Thu, Mar 31, 2011 at 06:13, Michael Grier <mr.michael.grier@...>
>> wrote:
>>>
>>> It does work; you have to have the comma in there... (%2C)
>>>
>>> http://www.imdb.com/Name?Gibson%2C%20Mel%20%28I%29
>>>
>>> redirects to
>>>
>>> http://www.imdb.com/name/nm0000154/
>>
>> Right; I'll look into integrating this solution instead of the current
>> search
>> done to convert from titles/names to imdbID, thanks!
>
>
> Some notes:
> 1. If you want to grab the nm ids, you will have to do it before you
> canonicalize any names that IMDb did not in the flat files, OR save
> the original name when you canonicalize a name.
(Continue reading)

Davide Alberani | 10 Apr 2011 17:16
Picon
Gravatar

Re: IMDb IDs aren't really necessary for names and titles

On Fri, Apr 8, 2011 at 23:38, Michael Grier
<mr.michael.grier@...> wrote:
>
> 3. It won't find:
>    A: Anything with a + (plus symbol) in the name or title.

Have you tried replacing the plus symbol with '%2B' ?

--

-- 
Davide Alberani <davide.alberani@...>  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
Xperia(TM) PLAY
It's a major breakthrough. An authentic gaming
smartphone on the nation's most reliable network.
And it wants your games.
http://p.sf.net/sfu/verizon-sfdev

Gmane