數據科學之路 的多語言選擇

image.png

python 也正是因為 scipy numpy pandas sklearn matplot tf 四個包在數據科學中非常顯眼,不過由于GIL的影響,python 和java 為人詬病的就是【慢】,所以這些缺點是我們一直選擇去尋找更快的替代方案 ,集中在 c++ golang julia 三種語言 swift 和rust 也稍微找找,java 太沉重了,scala 有spark 支撐 。clojure 語法有點反人類,R 和python 速度大同小異


image.png

jupyter 支持的后端語言
https://github.com/jupyter/jupyter/wiki/Jupyter-kernels

image.png

c++版
pandas https://github.com/hosseinmoein/DataFrame
numpy https://github.com/QuantStack/xtensor
https://github.com/ndarray/Boost.NumPy

https://github.com/AtsushiSakai/numpycpp
https://github.com/rogersce/cnpy

sklearn https://github.com/davisking/dlib

image.png

data science tookit https://github.com/meta-toolkit/meta/
https://meta-toolkit.org/
matplot https://github.com/lava/matplotlib-cpp

image.png

golang版
data science https://github.com/cpmech/gosl
matplot https://github.com/zieckey/gochart
scipy https://github.com/montanaflynn/stats
sklearn https://github.com/sjwhitworth/golearn
https://github.com/pa-m/sklearn
https://github.com/qingtiandalaoye/GoDataframe
https://github.com/piquette/finance-go
https://github.com/orcaman/financial
numpy
https://github.com/ledao/arrgo
https://piquette.io/projects/finance-go/
tf.go
https://github.com/google/grumpy

image.png

julia
https://github.com/JuliaPy/Pandas.jl
http://www.lxweimin.com/p/87977f582c27?utm_source=oschina-app
https://github.com/cstjean/ScikitLearn.jl

matplot https://github.com/JuliaPlots/Plots.jl
http://docs.juliaplots.org/latest/
tf.jl
https://github.com/IntelLabs/ParallelAccelerator.jl
https://blog.csdn.net/u014636245/article/details/82216716
https://blog.csdn.net/a_step_further/article/details/79662088
有一個模型轉換器
https://github.com/nok/sklearn-porter

https://github.com/7125messi

image.png

swift
numpy https://github.com/sonsongithub/numsw
https://github.com/nifty-swift/Nifty
plot https://github.com/i-schuetz/SwiftCharts

https://github.com/apple/coremltools
coreml https://developer.apple.com/documentation/coreml

image.png

rust
https://github.com/rust-numpy/rust-numpy

tf https://github.com/tensorflow/rust
sklearn https://github.com/maciejkula/rustlearn
pandas https://github.com/weld-project/weld
matplot https://github.com/SiegeLord/RustGnuplot
https://github.com/ubnt-intrepid/rustplotlib
https://github.com/milliams/plotlib
https://github.com/coder543/dataplotlib

image.png

R
pandas dplyr
https://www.dataquest.io/blog/python-vs-r/
http://www.10tiao.com/html/403/201806/2650629741/1.html
http://mathesaurus.sourceforge.net/r-numpy.html
https://github.com/topepo/caret

1. caret

image.png

caret has been used by me with success: http://caret.r-forge.r-project.org/

2. MLR

There is also the MLR package: https://cran.r-project.org/web/packages/mlr/index.html

From the site:


image.png

H2O

?著作權歸作者所有,轉載或內容合作請聯系作者
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發布,文章內容僅代表作者本人觀點,簡書系信息發布平臺,僅提供信息存儲服務。

推薦閱讀更多精彩內容

  • 工欲善其事,必先利其器。總結一下,方便多了。R語言還是很牛逼的,可以干很多事情。有一把順手的刀還是很重要的。 0....
    Liam_ml閱讀 4,697評論 1 60
  • # Awesome Python [![Awesome](https://cdn.rawgit.com/sindr...
    emily_007閱讀 2,227評論 0 3
  • 原文鏈接 《Python數據分析》(Python for Data Analysis, 2nd Edition)第...
    李紹俊閱讀 8,584評論 0 5
  • 最難受的就是不了了之。 但是也能理解, 誰都需要生活, 誰都需要找個懂你的人, 我只希望, 你覺得他并不壞的話, ...
    李康有為閱讀 108評論 0 0
  • 在這個世界相遇- - 陳奕迅 星月相掩 于大海上 微風搖曳 細雨也彷徨 流霞飛舞 群青深處 你我曾 相遇的地方 你...
    窗邊的藍豆豆閱讀 350評論 0 1