Commit graph

20 commits

Author SHA1 Message Date
e92abdab13 Add functionality to restart tree training where previously left off. 2018-08-09 16:34:10 -07:00
55eab76610 Simplied code. 2018-08-08 15:57:28 -07:00
9d9dc9ef8d Optimize tree training so that the best split is not applied twice 2018-08-08 11:34:02 -07:00
74151b94db Add alternative way where functions are computed only at final step. 2018-08-07 15:49:55 -07:00
bf56dfb59d Add ability to compute different error rates. 2018-08-07 10:52:52 -07:00
e1caef6d56 Implement naive mortality error measure 2018-07-25 15:29:09 -07:00
d4853f5232 Change how trees are saved so that they are compressed. 2018-07-18 15:29:55 -07:00
05f9122b58 Add capability to load trees back into memory. 2018-07-17 13:54:59 -07:00
fffdfe85bf Finish competing risk implementation. Fix a bug in tree training
algorithm.
2018-07-16 16:58:11 -07:00
462b0d9c35 Implement Response & GroupDifferentiators for CompetingRisk problems.
Also adjusted how settings are done to allow for specifying
differentiators & responses that may require arguments.

Note that CompetingRisk code is untested at this point.
2018-07-10 14:43:51 -07:00
6b62ad95c3 Add support for loading datasets by CSV files. 2018-07-06 13:21:56 -07:00
2cdcbe6cbf Refactor different classes into subpackages. 2018-07-05 12:59:29 -07:00
e96a578ac9 Refactored code to allow for a class of covariates to determine which
SplitRules are tested.

Most of the refactoring involved the creation of a Covariate class (one
instance per column); with SplitRule and Value being folded in as inner
classes.
2018-07-03 17:00:02 -07:00
e7af65e8fd Fixed a bug where Splits could be generated that had an empty daughter
node
2018-07-03 15:15:09 -07:00
254727e594 Add support for saving trees as forest is being trained.
Support for loading the trees back is not yet written.
2018-07-03 12:31:08 -07:00
df35a2007a Remove inefficient debug code previously missed. 2018-07-03 11:20:15 -07:00
5f280d09a1 Add parallel support & fix fatal bug in TreeTrainer#findBestSplitRule. 2018-07-02 23:16:20 -07:00
df7835869a Add functionality to train a random
forest in serial.
2018-07-02 17:58:53 -07:00
6192643e12 Change ResponseCombiner to be a Collector that's compatible
with Streams.
2018-07-02 12:27:18 -07:00
3c9c78741f Basic functinality to train a single regression tree is
implemented.
2018-07-01 22:22:12 -07:00