Commit graph

23 commits

Author SHA1 Message Date
fffdfe85bf Finish competing risk implementation. Fix a bug in tree training
algorithm.
2018-07-16 16:58:11 -07:00
462b0d9c35 Implement Response & GroupDifferentiators for CompetingRisk problems.
Also adjusted how settings are done to allow for specifying
differentiators & responses that may require arguments.

Note that CompetingRisk code is untested at this point.
2018-07-10 14:43:51 -07:00
4bbb0e0948 Fix a bug whereby FactorCovariate fails when "NA" is provided.
Also improved testing around this.
2018-07-06 13:33:58 -07:00
6b62ad95c3 Add support for loading datasets by CSV files. 2018-07-06 13:21:56 -07:00
fe9ff37dcf Upgraded Settings class to allow for covariates to be built from
provided values.
2018-07-05 19:04:26 -07:00
b010e79269 Add basic Settings class with persistence. 2018-07-05 13:59:52 -07:00
2cdcbe6cbf Refactor different classes into subpackages. 2018-07-05 12:59:29 -07:00
662a6cf761 Add OTFI imputation when training forest.
No tests have been written yet so this is still WIP.
2018-07-05 12:05:07 -07:00
Joel Therrien
c048a285a1 Merge branch '01-factors' of joel/RandomSurvivalForests into master 2018-07-04 20:27:42 +00:00
3b8952e13c Added some tests for FactorCovariate. Moved workshop over to test
codebase.
2018-07-04 13:24:34 -07:00
c7298f7da6 Fix incorrect use of non-concurrent Random object in NumericCovariate. 2018-07-04 12:18:27 -07:00
e0cfed632f Add FactorCovariate; testing required. 2018-07-04 12:18:06 -07:00
2259528c22 Small modificaton of NumericCovariate; child classes now gurantee they
return NumericCovariate when getParent() is called.
2018-07-04 10:54:46 -07:00
38e70dd3a1 Add BooleanCovariate 2018-07-04 10:54:07 -07:00
e96a578ac9 Refactored code to allow for a class of covariates to determine which
SplitRules are tested.

Most of the refactoring involved the creation of a Covariate class (one
instance per column); with SplitRule and Value being folded in as inner
classes.
2018-07-03 17:00:02 -07:00
e7af65e8fd Fixed a bug where Splits could be generated that had an empty daughter
node
2018-07-03 15:15:09 -07:00
254727e594 Add support for saving trees as forest is being trained.
Support for loading the trees back is not yet written.
2018-07-03 12:31:08 -07:00
df35a2007a Remove inefficient debug code previously missed. 2018-07-03 11:20:15 -07:00
5f280d09a1 Add parallel support & fix fatal bug in TreeTrainer#findBestSplitRule. 2018-07-02 23:16:20 -07:00
df7835869a Add functionality to train a random
forest in serial.
2018-07-02 17:58:53 -07:00
6192643e12 Change ResponseCombiner to be a Collector that's compatible
with Streams.
2018-07-02 12:27:18 -07:00
3c9c78741f Basic functinality to train a single regression tree is
implemented.
2018-07-01 22:22:12 -07:00
7a467207a4 Initial commit; some base classes have been defined
but no logic exists yet.
2018-06-29 12:04:59 -07:00