Hello, Pangeo community!

I am a graduate student in Physical Oceanography hoping to contribute to the development of xarray and Pangeo itself one day

I am only beginning to learn these tools though. Sorry if it is going to sound like a very basic question.

Somehow it is not easy to find an open source notebooks with multiple regression in Earth Sciences.

Iād be curious how people do it and if there is a standard way to do it.

From actual talking to people in my institution I gathered that about half of the people write their own model from scratch (using matrix formulation) and half is relying on existing solutions. Too many people use Matlab.

I want to perform a multiple linear regression analysis using principal components I extracted from a bunch of tide gauges along the East Coast and satellite altimetry across a wider domain in the Atlantic.

Would it be a standard thing to just use `statsmodels.regression.linear_model.OLS`

?

Does it work well with xarray dataset? How do I make it work for the whole field (if I have to establish regression coefficient in each grid cell of altimetry data regressed onto PCs).

Thank you!

Yuta