I was working on bug#13381 and I realized that I need some more stats background. Or at least, some application of the stats classes I had 6-7 years ago would be helpful.
We have this problem were we need to categorize projects into three groups: new/never-started, active, and abandoned/inactive. Active/inactive is fairly easy - figuring out new from active is more difficult, however.
Thanks to cvs2mysql I have some information on how frequently a project has committed files to their module. The problem is how to use this information to figure out that a project is still "new". As far as I can tell right now, a project is still new if:- They've never committed anything
- They've only committed a few times
- They've only committed things for a short period of time over the entire commit history (say, 7 days)
- Time of each transaction
- Number of commits
- Number of files per commit