improve sklearn pipeline
improve sklearn pipeline