Skip to content

bigdong89/xgboostExtension

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

xgboostExtension

xgboost Extension for Easy Ranking & Leaf Index Feature

Pypi package: XGBoost-Ranking Related xgboost issue: Add Python Interface: XGBRanker and XGBFeature#2859

As we know, Xgboost offers interfaces to support Ranking and get TreeNode Feature. However, the example is not clear enough and many people leave their questions on StackOverflow about how to rank and get lead index as features.

Now xgboostExtension is designed to make it easy with sklearn-style interfaces. Also it can work with sklearn cross-validation

Packages:

  • XGBRanker
    • Give rank scores for each sample in assigned groups.
    • The scores are valid for ranking only in their own groups.
  • XGBFeature
    • Give the index of leaf in trees for each sample.
    • XGBFeature is very useful during the CTR procedure of GBDT+LR. In addition, it's better to take the index of leaf as features but not the predicted value of leaf. Firstly, the predicted values of leaves are as discrete as their index. Secondly, the predicted values of leaves like [0.686, 0.343, 0.279, ... ] are less discriminant than their index like [10, 7, 12, ...]. So we take the index as features.

Version

  • The version of XGBoostExtension always follows the version of compatible xgboost.
  • For example:
    • XGBoostExtension-0.6 can always work with XGBoost-0.6
    • XGBoostExtension-0.7 can always work with XGBoost-0.7
    • But xgboostExtension-0.6 may not work with XGBoost-0.7