何毓琦的个人博客分享 http://blog.sciencenet.cn/u/何毓琦 哈佛(1961-2001) 清华(2001-date)

博文

On Finding a Solution to a Multi-Faceted and Complex Problem 精选

已有 11261 次阅读 2017-12-15 22:24 |个人分类:S and T|系统分类:海外观察

Fornew readers and those who request to be “好友 good friends” please read my 公告 first


In our daily living we often face the problem of making a decision on a  rather complex and multi-dimensional problem, from the mundane to  the life  changing. For example,from  “where should my family go for vacation this year?”, to “should I propose to my current girlfriend?” or  “what should I do on this life-or-death medical decision?”. More recently, our own town government wants to borrow and spend a considerable of money (about 100million dollars) for various capital investments in the town. The town peoplewere  asked to approve/reject such  expenditure with its attendant tax increase.

Human instinct is often trying to find a good, and if possible the best  (i.e.,optimal),solution to such a decision problem. Unfortunately, optimal solution to such complex decision problems are seldom possible either theoretically, computationally, or practically. My own academic research for the past quarter century have been devoted to make some inroads to the solution of this question of “optimization”. The one sentence summary of my work is this

“We should stop searching for the truly optimal solution and  look for good enough solution instead.”

This is because the effort to finds the optimal often turns out to be not cost-effective even if it were possible. Besides the world in fullof uncertainties and  unknowns, our best laid plan can go awry. Best decision does not always  guarantee best outcomes. Instead, what I suggest we should do  is be willing to settle for the good enough which are easier to find.  This is intuitively reasonable! And I submit human civilization advanced for thousands of years on just such premises.  My research effort and those of  others since my first discovery on a rainy afternoon in 1991 simply made  this notion precise,quantitative, and scientific rather than just intuitive and vague.  To really follow the detail arguments require advanced and specialized  knowledge.  But I want to present a simple demonstration here which I  believe everyone can follow and play with and possibly use in actual daily  living. You need a PC that runs MS EXCEL software.

Imagine you face a complex problem and you have to choose one out of  200 possible solutions/decisions. You can of course, brute force test out each one of these 200 choices and then pick out the best. But let us suppose that to test out each one of the choices can be costly and time consuming say millions of dollars and months of time – i.e., basically impossible. So what do you do?  Well, first we soften our goal of finding the BEST. Instead we say we’ll accept any solution/decision that is in the top 12 of the 200  (the top 6%). Next, we use some quick-and-dirty approximations plus whatever  intuitions we  have to rank order the 200 choices and take the top 12  based on such approximate method. Now the question is

Question 1: What are the chances that in these approximately determined top-12s, there are actually true top-12s (or in other words,  how are the list of estimated top-12 and the list of true top-12 aligned)?

The demonstration attached here is designed to answer this question.  What you need is a PC that runs Microsoft EXCEL. You open the attached spread sheet with this message using MS-EXCEL. Then you  need to flipback and forth between  reading the instruction in this message while also viewing the spread sheet in a different window.  Better yet, print out this message separately while the instruction directs you to do various things with the EXCEL sheet you opened. This way you don’t have to flip back and forth betweentwo windows on your PC. So here goes the instruction for this demo:


If you open the EXCEL sheet you’ll see that it has 5 or more columns (A-E) each  of which has 200 rows. Of course I cannot show all 200 rows all together, there  isn’t space. Thus only the first 14 rows and the last 5 rows are shown for  clarity (the middle rows are squeeze together but they arethere). The first column is simply the numbers 1 thru 200 ordered representing the 200 decisions  one can make. The second column lists the cost(or result or performance) of  each decision for the complex problem – the first decision has the least cost of 1  and second has cost 2 and so on. . . . If you click on the button “reset” you will see the third column (denoted as “order”) which will tell you which row is best (in this case the cost/performance/result in the second column.The least cost happens to be in the first row in this reset case, the second least cost in the second row  and so on. You may need to "enable editing and then enable content" of the spread sheet for the buttons  to work ). Now this is supposed to represent a very complex problem. Thus, we cannot know the true cost (and thus its goodness or rank orders) for each of the decision without prohibitive amount of computation and/or testing (in other words pretend you cannot see columns 2 and 3).

Question 2: How are we going to pick adecision in this situation?

Answer: suppose we have a quick-and-dirty method (could be simply your  intuition) that can easily estimate the cost of each decision. But this method or intuition can be grossly wrong in the estimated cost. We represent this error by the column titled “noise/error”. This is the fourthcolumn of  numbers which are randomly distributed between 1 and 100 (Remember the true cost ranges from 1  to 200. Thus this noise/error can be quite large.The best cost of 1 can be wrongly estimated as large as 101.) When this column is added  to the third column we get the fifth column which is the estimated cost of each  decision by this approximate method (true cost plus the error).  This column represents all the information we have. Now since we are willing  to accept any of the top 12decision as a good enough decision, we are simply going to order the rows based on the highly approximate estimates (i.e.,  according to the numbers in column 5 – the best estimated cost  in the first  row and so on) and say these top 12 rows must containsome true top-12 rows. You do this but clicking on the button  “order”. And you get your answer to question 1above in the third column.

Now the first time you do this you may or may not be surprised to find that in  the top-12 choices (rows) based on the grossly approximate method  there  are several (3 to 5) true top 12 results. You may say this is pure luck.  Thus I invite you to try again. This time by clicking the button “alignment” which is simply a short cut of clicking the “reset-addnoise-order” buttons in succession.  You can do this as many times as you like and you will find that the true rank order and the approximate rank order are in fact “aligned” some 4+ out of 12 chances. Now the answer to question 2 above is clear – you use whatever approximated  means to rank order  the decisions very approximately. Then you pick out the estimated top-12 choices and investigate these 12 in detail instead  of all 200 of them – a saving of some 20 times of the work!   You are guaranteed to find several true “good enough”  decisions for your purpose.

Now you may say, I really have no idea how good my approximate rank order scheme is. It could be as bad as randomly select 12 out of the 200 by blind choice. The EXCEL sheet also provides a means for you to examine this also. The 6th button “infinite noise” adds random number1  thru 10000 to the true cost in each row and thus completely overwhelms the true cost, 1 thru 200. The 7th button “blind choice” simply does what the button “alignment” does but with essentially infinite errors added which will overwhelm the true cost. However, before you click this button a couple of times, venture a guess of the answer to the question 1 above. See how good your intuition really is.


Have fun playing with this spread sheet!


By the way, a historical note on the science of discovery. This is how I invented  the tool of “Ordinal Optimization” one rainy afternoon at home in 1991 using  the above spreadsheet thus described. Currently there are close to 1000 scientific papers and one book on this subject, an established and practical  tool of decision making for complex problems.

OO Demo.xls

Notes added 1/1/2018. It is obvious that the numbers "200" and "12' used in the spread sheet demo can be easily changed to suit the readers' needs and interest. You are welcomed to play with it.



https://wap.sciencenet.cn/blog-1565-1089910.html

上一篇:WELL DESERVED HONOR
下一篇:Another article from my friend, Art Chen
收藏 IP: 74.104.134.*| 热度|

8 刘全慧 许培扬 白龙亮 武夷山 马陶武 张士伟 徐晓 刘炜

该博文允许注册用户评论 请点击登录 评论 (2 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-3-29 00:19

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部