By Ron Cody
Please bear in mind that there's a moment variation of this booklet (at an identical price). as a result, make sure you purchase the second one version and never the unique. here's a hyperlink to the second one version: Cody's info cleansing options utilizing SAS, moment Edition.
I have rewritten each software and each macro within the new version. There also are many extra priceless macros on hand (you can obtain them from the SAS internet site). additionally new is a bankruptcy on SAS integrity constraints and audit trails.
Read Online or Download Cody's Data Cleaning Techniques Using SAS Software PDF
Best enterprise applications books
The 3rd in a seven-volume set, this advisor offers the newest precise reference fabric for the systems in SAS/STAT together with research of variance, regression, express info research, multivariate research, survival research and masses extra.
This ebook is really normal of others within the so-called SAP Cookbook sequence. Overpriced relative to the quantity of data you get. The questions and solutions are a bit reminiscent of the O'Reilly Hacks sequence. yet this playstation narrative has the flavor of minimum textual content, and no subgroupings of the a hundred questions offered.
* Written via knowledgeable with greater than 30 years of expertise in each function within the IT undefined, this booklet confronts improvement procedure difficulties head-on, and it tackles the severe steps that has to be taken to make sure good fortune* Dives into issues equivalent to opting for possibilities, making plans for achievement, construction a suitable enterprise version, assembling a workforce, constructing software program, coping with groups, and effectively advertising and promoting the product* The booklet fills a void within the present marketplace, and is a perfect learn for all IT execs
A booklet and booklet containing over a hundred and twenty recipes involved in complex management projects to construct and configure strong databases with IBM DB2 booklet and booklet. grasp the entire vital facets of management from cases to IBM's most modern excessive Availability expertise pureScale with this ebook and ebook.
- 10 Minute Guide to Lotus Notes 6
- Google Analytics
- Web Performance: The Definitive Guide
- QlikView Server and Publisher
Extra info for Cody's Data Cleaning Techniques Using SAS Software
PATIENTS PLOT; TITLE "Using PROC UNIVARIATE to Look for Outliers"; VAR HR SBP DBP; RUN; The procedure option PLOT provides you with several graphical displays of the data; a stem-and-leaf plot, a box plot, and a normal probability plot. Output from this procedure is shown next. 0001 Quantiles (Definition 5) Quantile 100% Max 99% 95% 90% 75% Q3 50% Median 25% Q1 10% 5% 1% 0% Min Estimate 900 900 210 208 87 74 60 22 22 10 10 Continued 26 ® Cody’s Data Cleaning Techniques Using SAS Software Using PROC UNIVARIATE to Look for Outliers The UNIVARIATE Procedure Variable: HR (Heart Rate) Extreme Observations ----Lowest---- ----Highest--- Value Obs Value Obs 10 22 22 48 58 23 25 15 24 20 90 101 208 210 900 8 4 19 9 22 Missing Values Missing Value Count .
A’ - ’Z’ INVALUE DBP_CK (UPCASE) 60 - 120, . TXT" PAD; FILE PRINT; ***Send output to the Output window; TITLE "Listing of Invalid Patient Numbers and Data Values"; ***Note: We will only input those variables of interest; INPUT @1 PATNO $3. @15 HR HR_CK3. @18 SBP SBP_CK3. ; IF HR = 8888 THEN PUT PATNO= "Invalid character value for HR"; ELSE IF HR NE 9999 THEN PUT PATNO= HR=; IF SBP = 8888 THEN PUT PATNO= "Invalid character value for SBP"; ELSE IF SBP NE 9999 THEN PUT PATNO= SBP=; IF DBP = 8888 THEN PUT PATNO= "Invalid character value for DBP"; ELSE IF DBP NE 9999 THEN PUT PATNO= DBP=; RUN; The UPCASE option converts any character values to uppercase before it is determined if the value fits into one of the specified ranges.
Next, the program is turned into a macro so that it is easier to use. Program 2-12 uses PROC UNIVARIATE to print out the bottom and top "n" percent of the data values.