NodeXL Data Analysis Task List: Steps for data import, scrub, analysis, and visualization

Oct 6, 2009 at 10:14 PM

Most data analysis tasks with NodeXL will follow a common set of steps:

  • Import
  • Merge duplicate edges
  • Calculate (ALL) network metrics
  • Create clusters
  • Insert sub-graph images
  • Sort edge list in a way that usefully effects the order of layout in the graph display
  • Auto-fill columns (and map data to display attributes): Set shape, color, opacity, size, and label/tooltip
  • Show graph
  • Read workbook
  • Adjust layout
  • Dynamic Filters – selectively hide edges and nodes
  • Layout Again
  • Return to spreadsheet to sort or calculate data
  • Integrate additional edge lists

Did I miss any steps that you use? There are multiple analytic goals for network data sets that guide what happens next.

What are your goals for network analysis? Please share them here.

-- Marc Smith

Dec 1, 2009 at 11:31 AM

A very nice feature in the auto-filling could be to color the edges according to clusters. That is, if an edge connects two nodes of a given cluster, that edge could follow the same color as the cluster, otherwise, a default color is assigned.