I know that vw can handle very raw data(e.g. raw text) but for instance should one consider scaling numerical features before feeding the data to vw? Consider the following line:
1 |n age: 80.0 height: 180.0 |c male london |d the:1 cat:2 went:3 out:4
Assuming that typical age ranges from 1 to 100 and height(in centimeters) may range from 140 to 220, is it better to transform/scale the
height so they share a common range? I think many algorithms may need this kinda of preprocessing on their input data, for example Linear Regression.