Here’s an illustration I pulled together using Python, NetworkX, and D3.js to illustrate the interconnections between abstracts that will be presented at the VisionĀ Sciences Society 2014 annual meeting in approximately 2 weeks. Orange dots represent abstracts, Light Blue dots represent authors with at least one first authorship, and Dark Blue dots represent other authors (second through last).
data:image/s3,"s3://crabby-images/c99fa/c99fac2f4773b673bf7d56c1e5ce3521d4dc0f0f" alt="VSS DNA v1"
As you can see, there are large numbers of abstracts that have few shared authors. Ā Those abstracts that share authors oftenĀ join together to create “chains” of students, advisors, and colleagues.
This is a first version, hastily pulled together, so there are a few problems. Ā The nodes are assigned to authors by name, which can be a problem for authorsĀ sharing the same name (which createsĀ more connections than appropriate for a given node) or who Ā have inconsistent reporting of their nameĀ (for example, omitting the middle initial or alternate spelling, which can create another erroneous node). I amĀ thinking of addressing the duplicate node issueĀ by using a string similarity metric (e.g., Levenshtein distance) to find stringsĀ that containĀ similar names to combine the connections, but this could be an issue if the names are truly different people. Alternatively, I could incorporateĀ the authors’ affiliations, but this carries similar issues (e.g., I report my affiliation as “University of Giessen” while colleagues report it as “Justus-Liebig-UniversitƤt GieĆen”).
AlthoughĀ there are lingering issues, it is still an interesting illustration of the connections between the different abstracts being presented at VSS 2014.
Here’s the code on GitHub:Ā visvssrelationships