August 30 to 01 2016, Santa Clara, USA.


Speaker "Songxiao Zhang" Details

Name :
songxiao zhang
Company :
Title :
Data Engineer
Topic :

pyh3: Scalable and High Performance Graph Visualization in 3D Hyperbolic Space

Abstract :

As the big social networks such as Facebook, Twitter, Instagram etc. are becoming super popular, direct hits for content generating websites such as New York Times and BuzzFeed is taking a smaller and smaller portion of content views. People are more getting used to see these content in social networks in posts shared by friends. According to a news in Business Insider 2013, only 23% of BuzzFeed views are from directly. To analyze how the content is viewed on Internet, we need to trace how a post is shared across network and investigate how the viral content spread. With the spread process into a spanning tree layout, an analytic 3D visualization can effectively deliver such results. The challenge of getting a minimum spanning tree layout from a large dataset comprising the relational information is computationally expensive. The tree layout contains analytic information or results of how information flows through the tree in network, and the layout shape indicates the pattern of information flow. When visualizing a larger tree, a trade-off between overall structure and node details is getting harder to manage. A fish-eye effect visualization is preferred to give a clear overall spanning tree structure and at the meantime, clearly present node details that are currently in focus. H3 (3 dimensional visualization in hyperbolic space) is a good candidate developed by Standford Imagery Laboratory providing a scalable, high performance and interactive graph visualization in 3D hyperbolic space. We found this algorithm is good at tracking and presenting massive amounts of posts sharing across different social networks over time. pyh3 is capable of laying out trees on the order of millions of nodes in a matter of minutes.

Profile :
Songxiao graduated from MS. in Computer Science in Courant Institute of Mathematical Sciences of New York University. He interned in Buzzfeed's Optimization Squad and developed a Python version of H3 for visualizing data from the Pound project at Buzzfeed. This package, capable of laying out trees on the order of millions of nodes in a matter of minutes. He is an incoming Software Engineer in BlackRock, Inc.

Get latest updates of 4th Annual Global Big Data Conference
sent to your inbox.

Weekly insight from industry insiders.
Plus exclusive content and offers.