Skip to content
dmarcos edited this page Sep 28, 2011 · 1 revision

Statistical Analysis Library

Requirements Draft

Required

  • sort: using a given column (either generating a new column or reorganizing based on a column)
  • min: of a column
  • max: of a column
  • sum: of a column
  • mean: of a column
  • median: of a column (comes naturally out of the sorting)
  • stddev: of a column

Useful

  • sin,cos, tan, asin, acos, atan
  • cumulative sum
  • quartiles
  • log, power
  • sqrt etc

Advanced Functionality

  • columnar math: select 2 or more columns and apply a mathematical function and generate a new column
  • select: apply a filter to the data based on one or more files and generate a subset
  • regression: fitting functions to data

Clone this wiki locally