[CS241] data sharing

Avram Levi avramlevi at lems.brown.edu
Mon Oct 29 21:53:13 EDT 2007


skip,

my mom is in town visiting me (after a year and a half) so i was away 
from the computer all day long, just got your message... Most of our 
path's have vp0 as the top node which prof. charniak said was 
practically trash so I don't think we really have to go all the way and 
fix these discrepancies.

on the other hand i suggest that the first person that gets something 
done (find the counts of the vp-vp-vp's or gets the sentences) to post 
his results on a google spreadsheet in a reasonable format so that all 
of us can make additions or whatever...

see you guys soon
Avram

Skip wrote:

> An amendment
>
> Where Avram and I have the same path, we have the same tense-counts 
> for all reasonable (tree.vals[8] == [01][01][1-4]) tenses.
> When I include unreasonable tenses, we get different counts.
> Since I counted paths with unreasonable tenses when I took paths with 
> 10,000 instances, we got some different paths.
>
> The following is a list of #path-instances by path-type for (my, 
> avram's) top 12 path-types.
>  (128041, 115487),
>  (125588, 88564),
>  (93718, 72037),
>  (39514, 38546),
>  (35535, 31282),
>  (32972, 25657),
>  (25879, 23351),
>  (21643, 18062),
>  (19329, 17029),
>  (17293, 10623),
>  (11499, 4074),
>  (10465, 521)
>
> These are the corresponding path-pairs (in my notation, ^X^ => X is 
> the root):
>                           ^VP^ SBAR S VP             
> VP_SBAR_S_VP_topNode_VP0.txt
>                                ^VP^ S VP                  
> VP_S_VP_topNode_VP0.txt
>                                  ^VP^ VP                    
> VP_VP_topNode_VP0.txt
>                            VP S ^S^ S VP               
> VP_S_S_S_VP_topNode_S2.txt
>                        ^VP^ NP SBAR S VP          
> VP_NP_SBAR_S_VP_topNode_VP0.txt
>                              VP ^S^ S VP            
> VP_S_SBAR_S_VP_topNode_S3.txt
>                         VP ^S^ SBAR S VP                 
> VP_S_S_VP_topNode_S2.txt
>                     ^VP^ PP NP SBAR S VP       
> VP_PP_NP_SBAR_S_VP_topNode_VP0.txt
>                             ^VP^ PP S VP         
> VP_S_SBAR_NP_S_VP_topNode_S4.txt
>                      VP ^S^ NP SBAR S VP           
> VP_S_SINV_VP_topNode_SINV2.txt
>                           VP ^SINV^ S VP                 
> VP_NP_VP_topNode_VP0.txt
>                               ^VP^ NP VP               
> VP_PP_S_VP_topNode_VP0.txt
>
> Avram:
> You said you'd not counted a some paths and attached a new file of 
> P(vp1|vp0,path)/P(vp1|path)s, but not a new file of counts.
> If the counts you sent out aren't current, can you attach your new 
> counts in the same format?
>
> On 10/29/07, *Skip* < dave.hirshberg at gmail.com 
> <mailto:dave.hirshberg at gmail.com>> wrote:
>
>     I get the same paths you have, but different tense-counts.
>     4 or 5 of our paths differ from Avram's, but where Avram and I
>     have the same path (our top 6 or so are the same), we have the
>     same tense-counts.
>
>     ?
>
>     On 10/28/07, *Tim St. Clair* < tstclair at cs.brown.edu
>     <mailto:tstclair at cs.brown.edu>> wrote:
>
>         Here is my initial set of data.  It looks like it is different
>         from juris, but I haven't checked it out that closely yet.
>
>         The listserv would not let me attach it, so here it is in
>         google document format.  Let me know if you want a copy of the
>         csv file.
>
>         http://spreadsheets.google.com/ccc?key=p3coYwZqOPPzwP5bOiMBrBQ&hl=en
>         <http://spreadsheets.google.com/ccc?key=p3coYwZqOPPzwP5bOiMBrBQ&hl=en>
>
>         -- 
>         Tim St. Clair
>
>         (617) 460 - 6497
>         _______________________________________________
>         CS241 mailing list
>         CS241 at list.cs.brown.edu <mailto:CS241 at list.cs.brown.edu>
>         http://list.cs.brown.edu/mailman/listinfo/cs241
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>CS241 mailing list
>CS241 at list.cs.brown.edu
>http://list.cs.brown.edu/mailman/listinfo/cs241
>  
>



More information about the CS241 mailing list