Difference between revisions of "Randomization in Excel"

Jump to: navigation, search
(2 intermediate revisions by one other user not shown)
Line 1: Line 1:
This article talks about some of the advantages and disadvantages of randomizing using Excel and also gives a step by step guide on randomizing using Excel.  
<onlyinclude>This article talks about some of the advantages and disadvantages of randomizing using Excel and also gives a step by step guide on randomizing using Excel. </onlyinclude>


== Read First ==
== Read First ==
Line 31: Line 31:
*Assign random numbers to each observation. While doing this, use "paste values" to stop recalculating the randomization. </br> <code> =rand() </code>
*Assign random numbers to each observation. While doing this, use "paste values" to stop recalculating the randomization. </br> <code> =rand() </code>
*Sort the random numbers from the lowest to the highest.
*Sort the random numbers from the lowest to the highest.
*Created an ordered serial number.
*Created an ordered serial number. If you need to balance the data, then first sort by the strata, then by the random values.  
*Assign groups using either the <code> '''mod''' </code> or the  <code> '''if''' </code> formulas.  
*Assign groups using either the <code> '''mod''' </code> or the  <code> '''if''' </code> formulas.  
<!-- Add stratification and balanced part here-->
<!-- Add stratification and balanced part here-->
Line 38: Line 38:
== Back to Parent ==
== Back to Parent ==
This article is part of the topic [[Randomized Control Trials]]
This article is part of the topic [[Randomized Control Trials]]
[[Category: Impact Evaluation Design ]]

Revision as of 11:19, 5 April 2018

This article talks about some of the advantages and disadvantages of randomizing using Excel and also gives a step by step guide on randomizing using Excel.

Read First

If randomization using Stata is feasible, then it should always be the preferred option as randomization in Stata is more easily reproducible.

Reasons why Stata is always preferable to Excel

Here are a some of the reasons :

  • Randomization done in Stata can be better documented through do files. Every step can be documented which makes it easier to reproduce the results.
  • Stata gives us the option of setting which version of Stata we use for randomization. This is useful when different researchers use different versions of Stata.
  • Better documentation and version control gives makes randomization results in Stata consistent across various runs.

Advantages and Disadvantages of Randomizing using Excel

However, Stata might not be available in some cases. For those cases, the advantages and disadvantages of randomizing using Excel are as follows:

Advantages

Here are some of the advantages of randomizing using Excel :

  • Balance/stratification can be achieved during a randomization using Excel.
  • Randomization using Excel is simple to implement and produces a record.
  • Since Excel is widely used, it is commonly understood and used by project staff.

Disadvantages

Some of the disadvantages of using Excel to randomize are as follows:

  • Excel is more mysterious to beneficiaries than public randomization(For example - drawing names from a hat, etc)
  • Randomization in Excel is less replicable than Randomization in Stata.
  • Since the randomization involves copying and pasting, it can be subject to human errors.
  • It is also less flexible to changes in randomization plan.

Steps to Randomization in Excel

Here are the steps of doing a successful randomization using Excel:

  • The first thing that needs to be done is to define a randomization rule. For example = the lowest 50% will be treatment, the rest will be assigned to control, etc.
  • Assign random numbers to each observation. While doing this, use "paste values" to stop recalculating the randomization.
    =rand()
  • Sort the random numbers from the lowest to the highest.
  • Created an ordered serial number. If you need to balance the data, then first sort by the strata, then by the random values.
  • Assign groups using either the mod or the if formulas.
  • Save the record.

Back to Parent

This article is part of the topic Randomized Control Trials