Caterpillar Inc. tube pricing prediction with random forests
This research is based on Kaggle1 competition for Caterpillar Inc. Caterpillar Inc. sells a variety of construction and mining equipment. Each machine relies on a complex set of tubes. Tubes can vary across a number of dimensions, material, bends, length and other parameters. Caterpillar Inc. relies on a variety of suppliers to manufacture these tube assemblies, each having their own unique pricing model. The challenge is to predict supplier's tube price based on tube parameters. Caterpillar Inc. reviles novel field to apply machine learning technique. In this paper I reveal a good approach to solve this task. This solution ranked in a top 10% among more than 1300 contestants. Ranking was based on RMSLE and the solution achieved 0.218223. I found useful to ensemble various random forests  predictions with xgboost  library.