Data Analysis & Simulation

Author Archive


EasyFit Used to Improve the Forecasting of Software Project Status

Monday, November 29th, 2010

The software development community struggles with a way to identify if their projects are on-schedule given the inherent risks of constant invention that inevitably has elements of uncertainty and risk. Current practice is for developers to estimate a software project, and attempt to consider (up-front) all variations to get a viable estimate of time and cost. This process is laborious, and even with due rigor, project slip when the realization that estimates versus actual times fail to match. This leads to costly project overruns and lack of trust in future estimates.

As part of the Agile movement for software development, we think there is a better way and are championing the use of Monte-Carlo simulation as a ways of assessing likely progress and dealing with delays as early as possible… read the full case study

Will Cloud Computing Make Risk Analysis More Economically Efficient?

Monday, November 1st, 2010

What is Cloud Computing?

For some time now, there has been a lot of buzz around cloud computing – the relatively new computing paradigm in which the resources, software, and information are shared on the computer clusters and delivered to the users on demand through the Internet. The idea behind cluster computing is not new: if your applications require a lot of computing resources or impose very strict reliability requirements which cannot be met by a single personal computer or a server, you can link a group of computers into a cluster that will provide a much better performance.

Why Not Build a Cluster Yourself?

Building and maintaining a computer cluster in your organization may have some downsides, such as large upfront investments into technology infrastructure.and high running costs. Of course, there are companies that will do the job of building and managing a computer cluster for you, but anyway, the bottom line is: depending on how loaded your cluster is going to be, it may or may not be economically feasible for your company to run it on-site. For instance, if you need to quickly perform a very CPU-intensive calculation (e.g. render a complex 3D scene), but only once a day, chances are the cluster will not pay off.

And here’s where cloud computing comes into play: you can have access to great computing resources, pay only as you use them, and not worry about the underlying technology infrastructure. These factors combined can provide a great economic benefit, and some major Internet players, including Amazon and Google, are already offering cloud computing platforms for those who want to make their businesses more efficient.

Is Cloud Computing a Good Fit for Risk Analysis?

As one might guess, not just any kind of application can be efficiently run on the cloud. Because at the core of a cloud is a number of computers linked into a cluster, it is very good at processing a large number of independent tasks, such as requests to a web server. That might be the reason why the cloud computing platforms offered by Amazon and Google are mostly used to run websites.

If you consider risk analysis, it looks like an ideal application to be run on the cloud: an input model of several megabytes that can be easily sent to the cloud, a need for huge computational resources to quickly perform Monte Carlo simulation and distribution fitting, sometimes a need for a lot of storage to hold intermediate results, and relatively small-sized analysis results that can be sent back to the users as text and graphics. Add to that the ever-increasing complexity of risk models used across various industries, causing analysts to wait for hours while their simulations are running, and you have a potentially good opportunity to make risk analysis more economically efficient.

To perform further research in this field, we have partnered with Supportex, a technology services company based in Czech Republic, Europe. Supportex has some good experience providing a cloud computing platform for solving problems much more complex than just processing requests to a web server, that is why we have decided to rely on their hardware infrastructure and domain knowledge to run some test applications and see if cloud computing can be of real help in the field of risk analysis.

EasyFit in Academia

Tuesday, August 3rd, 2010

Over the last five years, we have been adding new features to EasyFit mostly with business users in mind, but thanks to the nature of the product and the special academic pricing, it has become quite popular among the academic community: a quick search in Google reveals numerous research papers referring to EasyFit, just to name a few:
 

  • “Co-evolution of Social and Affiliation Networks” (University of Maryland, USA) [link]
  • “Power laws in top wealth distributions: evidence from Canada” (Brock University, Canada) [link]
  • “Duration of Coherence Intervals in Electrical Brain Activity in Perceptual Organization” (RIKEN Brain Science Institute, Japan) [link]
  • “Resource Management Schemes for Mobile Ad hoc Networks” (National University of Singapore, Singapore) [link]
  • “Modelling the diffusion of innovation management theory using S-curves” (University of London, UK) [link]

(see the larger list of papers using EasyFit)

It is pleasing to see EasyFit helping researchers in such diverse disciplines get their job done in a more efficient way.

EasyFitXL Is Now Compatible With Excel 2010

Monday, July 12th, 2010

EasyFitXL – the distribution fitting add-in for Excel – was first introduced with the release of EasyFit 4.0 back in 2007. When designing EasyFitXL, we did a lot of research as to which Excel versions to support. At that time, the latest version of Excel was Excel 2007, which included some new useful features, such as the support for larger worksheets and multi-threaded worksheet recalculation capability. However, many customers were not rushing to upgrade to Excel 2007 because of it’s controversial Ribbon Interface, so we had to make EasyFitXL compatible with the previous version – Excel 2003.

According to some publicly available data, Excel 2002 and Excel 2000 still had a considerable user base, so we have made a decision to support these two older versions as well. As a result, EasyFitXL initially included support for Excel versions from 2000 through 2007, covering perhaps over 99% of all Excel installations in the world.

Last month Microsoft has released Excel 2010 which does not make a big difference in terms of data analysis, however, with its release we started receiving compatibility complaints from our customers, so we performed an in-depth testing and released an updated version of EasyFit (available for download).

Building The International Distributors Network

Monday, June 7th, 2010

Over the past five years, we have been selling our distribution fitting software through Plimus Inc. – the U.S. based company responsible for processing credit card orders and sending out the license keys to the customers who purchased our products. What we like about Plimus is that apart from making the ordering process secure and smooth for our users, this company is also very responsive to any queries, which is especially important when it comes to dealing with people’s money.

However, in some countries there are state regulations preventing customers (mostly government organizations and academic institutions) from ordering software online. To make our distribution fitting products available to users in those countries, we are in the process of creating a network of international distributors. Specifically, this year we have partnered up with a Chinese, a Mexican, and three Taiwanese software resellers who now offer our products through their local distribution channels.

EasyFit 5.3 Released

Wednesday, January 20th, 2010

Recently a customer has contacted us and noted that the Inverse Cumulative Distribution Function (the Quantile Function) of the Inverse Gaussian distribution implemented in EasyFit works well for lambda=1902.1, mu=41857.0 and P=0.9, but fails for the same lambda & mu and P=0.99. Last week we have released an updated version of EasyFit that fixes the problem, and in this post we would like to elaborate more on the issue.

Evaluating the Inverse CDF of the Inverse Gaussian Model
Since the CDF of the Inverse Gaussian distribution is quite complicated (expressed in terms of the two Laplace Integrals), the Inverse CDF of this model is not available in closed form, and cannot be easily evaluated for a given set of distribution parameters. Initially, we have implemented an iterative approximation algorithm that evaluates the ICDF(P) using the CDF as well as the PDF to speed up the calculation. The algorithm itself works very well over a great range of input parameters, however, we have placed a limitation on how many iterations it is allowed to perform.

Because EasyFit is considered an interactive data analysis tool, we are always looking for a balance between the feature set and the performance, which is especially important when using EasyFit with Excel worksheets calculated in real time. The limitation on the number of iterations is necessary to make sure the algorithm doesn’t fall into an “infinite loop”, meaning the situation when it’s unable to reach the specified accuracy regardless of how long it continues to work. The problem usually happens when we are hitting the precision limitations of the computer’s CPU: in theory, the algorithm must converge in a limited number of steps, but in reality, it will just continue iterating over and over again without any accuracy improvements.

As a solution, we have made some improvements to the algorithm, making it more robust and efficient, so it now works with the same accuracy, but for a larger range of input parameters. For example, considering the parameters that initially caused the problem (lambda=1902.1 and mu=41857.0), the ICDF(P) can be evaluated for values of P up to 0.999925, which is more than enough for most statistical analysis applications.

Should You Upgrade?
Since this minor issue does not affect the accuracy of distribution fitting, you only need to upgrade if you are experiencing problems evaluating the Inverse CDF of the Inverse Gaussian distribution for P>0.9, otherwise EasyFit 5.2 will still work well for you.

Simulation & Probabilistic Analysis SDK Released

Thursday, December 3rd, 2009

The beta testing of our new product, the Simulation & Probabilistic Analysis SDK, is now over, and we want to thank our beta testers for their effort and valuable feedback. One of the most exciting things is that during the beta testing phase, we have not detected any bugs in the SDK, indicating the initial high quality of the product.

The production version of the SDK is now available for public download, so if you are a software developer and need to add distribution fitting or simulation features to your software with no hassle, feel free to download the fully functional version of the SDK and try it free for 30 days.

Another good news is that the Christmas is coming, so we decided to make a gift to software developers who are considering to purchase the SDK: until the end of December, the SDK Developer License can be ordered at a $500 discount – click here details.

Simulation & Probabilistic Analysis SDK Available for Public Beta Testing

Monday, October 12th, 2009

The new Software Development Kit enabling you to easily add Monte Carlo simulation and distribution analysis features to your applications is now available for public beta testing – please download the free beta version and take a look at the code examples (available in several languages, including C#, VB.NET, C++, and Visual Basic for Applications).

We would be glad to receive any feedback, questions or suggestions from you, so please feel free to drop us a line and let us know what you think about this product and how we can make it better.

Simulation & Probabilistic Analysis SDK Coming Soon – Beta Testers Welcome

Wednesday, September 9th, 2009

Earlier in December 2008, we announced our plans to create the Software Development Kit that would allow software developers to integrate the simulation and probabilistic analysis functionality of EasyFit into their custom software solutions. Since then, we have been working hard to bring the SDK to market. Despite some delays, we are glad to announce that we have successfully completed the development, and this month we plan to release the beta version of the product.

Become a Beta Tester Today!
Taking part in our beta testing program would not just grant you access to the cutting edge software technology – the most active beta testers will also be able to purchase the SDK at an introductory price before the production version of the SDK is released. To sign up, please contact us with your specific requirements (let us know how you plan to use the SDK and what development platform (C#, VB.NET etc.) you are using.)

New Version of EasyFit Available

Monday, June 1st, 2009

We have just released EasyFit Version 5.1 – the update that fixes a bug causing an incorrect calculation of the chi-squared GOF statistic for small sample sizes. To upgrade, uninstall EasyFit 5.0 from your computer, then download and install the latest version.

EasyFit: select the best fitting distribution and use it to make better decisions. learn more
EasyFit Screenshot - Click To Enlarge
Download Free Trial