sandbox.regression.try_ols_anova.form2design()

statsmodels.sandbox.regression.try_ols_anova.form2design

statsmodels.sandbox.regression.try_ols_anova.form2design(ss, data) [source]

convert string formula to data dictionary

ss : string
  • I : add constant
  • varname : for simple varnames data is used as is
  • F:varname : create dummy variables for factor varname
  • P:varname1*varname2 : create product dummy variables for varnames
  • G:varname1*varname2 : create product between factor and continuous variable
data : dict or structured array
data set, access of variables by name as in dictionaries
Returns:

vars : dictionary

dictionary of variables with converted dummy variables

names : list

list of names, product (P:) and grouped continuous variables (G:) have name by joining individual names sorted according to input

Notes

with sorted dict, separate name list wouldn’t be necessary

Examples

>>> xx, n = form2design('I a F:b P:c*d G:c*f', testdata)
>>> xx.keys()
['a', 'b', 'const', 'cf', 'cd']
>>> n
['const', 'a', 'b', 'cd', 'cf']

© 2009–2012 Statsmodels Developers
© 2006–2008 Scipy Developers
© 2006 Jonathan E. Taylor
Licensed under the 3-clause BSD License.
http://www.statsmodels.org/stable/generated/statsmodels.sandbox.regression.try_ols_anova.form2design.html

在线笔记
App下载
App下载

扫描二维码

下载编程狮App

公众号
微信公众号

编程狮公众号

意见反馈
返回顶部