Friday, November 11, 2011

Paper Reading #32

Taking advice from intelligent systems: the double-edged sword of explanations

Authors:	Kate Ehrlich	IBM, Cambridge, MA, USA
	Susanna E. Kirk	IBM, Cambridge, MA, USA
	John Patterson	IBM, Cambridge, MA, USA
	Jamie C. Rasmussen	IBM, Cambridge, MA, USA
	Steven I. Ross	IBM, Cambridge, MA, USA
	Daniel M. Gruen	IBM, Cambridge, MA, USA

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis - The researchers point out that intelligent systems can both help and hurt users by giving explanations. The explanations were hypothesized to help users when a correct answer is available, but harm when one is not available. Their goal was to determine the extent of this situation.
Content - Intelligent systems have the capability of giving suggestions and explanations on problems. If the situation is critical, users must be able to trust the system for correct responses and understand that the explanations are correct. However, even if the system is correct, the researchers claim that correct explanations can lead to incorrect results from ambiguity.
Methods - The researchers designed a prototype machine learning program (NIMBLE) that give recommendations and explanations in regard to network security issues. 19 participants were selected to perform a 2 hour session consisting of 24 timed trials of network security tasks. The task was to rank a set of 11 threats by importance, either low or medium. To help the users, the system gave recommendations in 3 levels: no recommendation, recommendation only, and recommendation along with a justification. The recommendations also could either have only one or no correct choices with the 3 given recommendation conditions. The helpfulness of the system was then rated in a Likert scale.
Results - Suggestions and justifications were found to significantly help accuracy when a correct choice was available. However, when it is not available, there was an insignificant drop in accuracy. In all, the researchers found that the participants used the suggestions widely, depending on the level of skill they have.

Discussion
The closest I can relate to this system is back when I used the spelling and grammar check of earlier Microsoft Word products. Back then, the explanations for some grammar corrections were vague, and some didn't even bother giving explanations. Granted, rationally parsing all of incorrect English is not an easy task, but I remember being unsatisfied with some of the results. For future work, these principles should be applied in areas where fault tolerance is necessary.

Paper Reading #31

Identifying emotional states using keystroke dynamics

Authors:	Clayton Epp	University of Saskatchewan, Saskatoon, Saskatchewan, Canada
	Michael Lippold	University of Saskatchewan, Saskatoon, Saskatchewan, Canada
	Regan L. Mandryk	University of Saskatchewan, Saskatoon, Saskatchewan, Canada

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis - Traditionally, emotion detection requires sensors to directly measure physical data of a participant. The researchers believe that their system can reliably detect participant emotions based on the dynamics of how they type.
Content - This paper avoids other types of machine learning processes and data collection to determine user emotional state. Here, the only data collected is solely they keystroke dynamics and feedback for the machine learning via questionnaires. Previously, non-fixed typing data collection was unable to accurately determine user emotional state, so fixed-length typing sets were giving for users for data analysis. Free text was too variable according to the researchers to accurately accomplish the researchers goals.
Methods - Instead of gathering keystroke data in a laboratory setting, the data was collected at home in a daily basis. 26 participants were asked to fill out a questionnaire about how they felt emotionally and then have them write a fixed length piece of text. The data was collected regularly for 4 weeks. The information collected was in 4 categories:

Keystroke features - keystroke duration and latency.

Content features - which characters are being typed including capital letters and numbers.

Emotional state classes - data collected from the questionnaires that included 15 questions about their emotional state.

Additional data points - the active process name for each collected keystroke.

Also since people often took pauses during the tests, further computation was performed on the keystroke data. If a pause was longer than the standard deviation of the data, the data point was thrown away, which took out 0.07% of all the data.

Results - Models were created from the data that detected confidence, hesitance, nervousness, relaxation, sadness, and tired states. Some states were temporary, such as excitement and anger, and were not classified properly within the fixed length text. However, only a few features were necessary to classify emotions accurately.

Discussion
The researchers accomplished their goal adequately but I think their system should have been designed to allow more flexibility, especially over time. To require that a user types a fixed length text is prohibitive to deploying their models to any free-form typing. However, since the content of the words typed are not analyzed itself, this may have simply been impossible. Otherwise, this type of machine learning could be applied to operating systems to act differently around users that give different emotional states through how they type. Also another data point they left out for data analysis is mouse movement. This data could further model generation and possibly make it more accurate.

Paper Reading #30

Life "modes" in social media

Authors:	Fatih Kursat Ozenc	Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
	Shelly D. Farnham	Yahoo!, Sunnyvale, California, USA

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis - The researchers suggest that people interact in different ways based on their social contexts of their life. These life facets can include work, school, and home environments. They also believe if that online interactions could be oriented against these facets, then the interactions will be more effective.
Content -

Modes of the self - different life facets afford different "modes" people interact within.

Focused sharing - Broadcasting information is generally disliked compared to the idea of focused sharing, which allows information to be shared with only a few people instead of everyone.

Mobile Me - A theme that many users explain their online interactions as being based on mobile interfaces.

Methods - 16 participants were asked to perform a two hour interview to help them understand the facets of their life. There were 3 specific goals:

Life mapping activity - Participants were asked to draw out parts of their lives and tell how they handle transitions from different states online, as well as how they communicate.

Need validation session - Theoretical situations were presented to the participants and they were asked to identify possible problems and how they would handle them.

Visual metaphors ranking activity - Users were asked to rank how the best way to visualize their life facets.

Results - For the life mapping activity, many of the participants drew a social meme map or a chronological map to visualize their lives. For the social meme maps, most of them start from the idea of "me" and spread outwards to family, work, and social facets. These categories were then further split up. Chronological maps were able to more accurately show transition states but lacked the detail that social meme maps could separate. For the color coding activity of marking communication practices, the closer the participant was to a person, the more likely it was that the participant would mark the possible communication channels in multiple ways. However, others were much more specific on how they communicated, avoiding crossover. For working environments, it was found that most people segmented their work and social life. For example many participants had an email specifically for work, personal, and junk email. Crossing the segmentations of their life facets were usually done by physically moving from one location to another, or by the passage of time.

Discussion

Many people simply ignore their life facets when communicating online. Historically this can have disastrous consequences, such as and employee badmouthing their boss on Facebook after they have become friends. Social networking sites now are implementing functionality that allows emulation of personal life facets in ways more normal for humans, such as Google and Facebook, but use may or may not catch on due to the different environment online interaction provides at the time. I think the researchers bring up a very good idea that is currently changing how people interact online. However, since different interactions with different people in different life facets is a generalization of the concept of privacy, and with more and more technology, there seems to be less privacy available. I do not think that a single mode of interaction is possible with humans, but it would simplify the process considerably.

Paper Reading #29

Usable gestures for blind people: understanding preference and performance

Authors:	Shaun K. Kane	University of Washington, Seattle, Washington, USA
	Jacob O. Wobbrock	University of Washington, Seattle, Washington, USA
	Richard E. Ladner	University of Washington, Seattle, Washington, USA

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis -The researchers believe blind and sighted people use touch interfaces differently. Using the differences in design practices can help different users with better gestures.
Content - Four design guidelines were set for observation:

Avoid symbols in print writing - Blind people may not learn print as a form of input.

Favor edges, corners, and other landmarks - spatial orientation without sight can be replaced by relative positions, increasing gesture effectiveness.

Limit time-based gestures - Blind people take a longer time to perform gestures, so this could limit what they can do.

Reproduce traditional layouts when possible - familiar layouts such as QWERTY let blind people immediately know how to use an interface.

Methods - 10 sighted and 10 blind people were asked to perform gestures in order to accomplish a task described by a moderator. After making two gestures, they were asked how effective they thought their gestures were on a Likert scale. The study was repeated but the participants were asked to perform the gestures they previously made and rate them again on a Likert scale.
Results - Because of the lack of visual feedback, blind people found their gestures more fitting than sighted people based on their Likert responses. Also the gestures from blind people were considerably more complex, having more edges and taking a longer time to complete. They were also more likely to use multi-touch compared to sighted people. The gestures from blind people were also physically larger. Overall, blind people did not find their gestures easier, but in different tasks there were more variance. This means that some tasks are easier than others.

Discussion
The researchers achieved their goal of understanding interaction with blind people reasonably well. Unfortunately the technologies created today have been built up for so long with visual feedback being an essential backbone, it is difficult to tell if blind people will ever have the same utility from technology. In the future, this issue could be solved by giving artificial sight, however that is far down the road.

Paper Reading #28

Experimental analysis of touch-screen gesture designs in mobile environments

Authors:	Andrew Bragdon	Brown University, Providence, Rhode Island, USA
	Eugene Nelson	Brown University, Providence, Rhode Island, USA
	Yang Li	Google Research, Mountain View, California, USA
	Ken Hinckley	Microsoft Research, Redmond, Washington, USA

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis - The researchers believe that bezel initiated gestures are superior to soft buttons, especially in non-ideal environments where focus can not be held on the interface.
Content - Bezel gestures start from the edge of touch interfaces. The other types of gestures tested were soft and hard buttons. A soft button is simply a GUI representation of a button where the hard button is an actual mechanical button. The second factor tested was whether or not the gestures were aligned to an axis. Those that were aligned to an axis are mark-based gestures, where the others are termed free-form.
Methods - 15 participants were asked to test the effectiveness of the different types of gestures in different environments. Participants were tested when sitting, standing, walking, and introduced 3 levels of distraction to these motor states. The least distracting environment allowed constant visual contact with the interface, where the most distracting environment did not allow the user to look at the screen. At the end of the study, participants took a questionnaire to determine other factors.
Results - More time was taken for gestures in more distracting environments, regardless of the gesture type. However, in this environment, it was found that bezel initiated gestures were superior to hard and soft buttons. There was no difference from gestures performed during sitting and standing. The accuracy of free-form gestures was lower than mark-based gestures. When distractions were not an issue, soft buttons performed better. The fastest and most preferred environment was sitting with no distractions and soft buttons.

Discussion
The bezel initiated gesture built into Android to display notifications is one of my favorite things about the user interface of the system. I am able to perform the action in distracting environments relatively easily. The only issue for future use is that since it is used to display the notification menu, there is a degree of dimensionality lost for future applications. However, in distracting environments such as driving, the best thing to do is to remove the distraction entirely, either by pulling over, or not messing with a phone.

Paper Reading #27

Sensing cognitive multitasking for a brain-based adaptive user interface

Authors:	Erin Treacy Solovey	Tufts University, Medford, Massachusetts, USA
	Francine Lalooses	Tufts University, Medford, Massachusetts, USA
	Krysta Chauncey	Tufts University, Medford, Massachusetts, USA
	Douglas Weaver	Tufts University, Medford, Massachusetts, USA
	Margarita Parasi	Tufts University, Medford, Massachusetts, USA
	Matthias Scheutz	Tufts University, Medford, Massachusetts, USA
	Angelo Sassaroli	Tufts University, Medford, Massachusetts, USA
	Sergio Fantini	Tufts University, Medford, Massachusetts, USA
	Paul Schermerhorn	Indiana University, Bloomington, Indiana, USA
	Audrey Girouard	Queen's University, Kingston, Ontario, Canada
	Robert J.K. Jacob	Tufts University, Medford, Massachusetts, USA

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Hypothesis - The researchers want to prove that fNIRS is a reliable way to measure brain states and to test the amount of stress different types of multitasking has on participants.
Content - FNIRS stands for functional near-infrared spectroscopy. It directly measures the oxygenation of the blood in the brain to determine how much stress user's brain is exhibiting. Multitasking is broken up into 3 possible scenarios, each with a different amount of stress on the brain.

Branching - Changing focus to another task requires keeping a state memory of the tasks.

Dual Task - Changing focus to another task does not require keeping a state memory.

Delay - The interfering task can be ignored until the first task is done.

Methods - The researchers tested the three types of multitasking with different scenarios of problem solving involving directing a robot explorer. Delay multitasking was tested by asking the participant whether two successive rock classifications follow in immediate consecutive order. Getting a message about distance can simply be ignored. Dual Task was handled by duplicating the circumstances of delay, but with focus being on both distance and rock classification. Only an immediate change in the current type of message required attention. Branching was tested by requiring the participant to remember the previous states of each message. 12 participants were selected. Each participant practiced the system before hand to get used to the multitasking environment before using the fNIRS equipment.
Results - Delay tasks were the most accurate and done the quickest. The slowest and least accurate was the branching test. The stresses caused by multitasking were easily seen by the fNIRS data, showing the greatest amount of oxygen depletion with branch-type multitasking.

Discussion
I find myself multitasking all the time. Using these principles if I created scenarios where I worked with delay-based multitasking primarily, I could theoretically become 26% more efficient according to the timing data collected. However, I'm fairly certain with the relative newness of computation in society in regards to evolutionary time, there is a very likely chance that a distinct evolutionary pressure is being put on people who can multitask more efficiently. In the mean time, technology such as fNIRS can be used to minimize "harmful" types of multitasking that our brains can not fully handle yet.

Paper Reading #26

Embodiment in brain-computer interaction

Authors:	Kenton O'Hara	Microsoft Research, Cambridge, United Kingdom
	Abigail Sellen	Microsoft Research, Cambridge, United Kingdom
	Richard Harper	Microsoft Research, Cambridge, United Kingdom

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human

Summary

Hypothesis - The researchers believed that people would act differently when operating a brain-computer device.
Content - The MindFlex game board is a system that consists of a hand-actuated circular obstruction game path, and a EEG headset to measure brain activity. The more the user concentrates, the more a fan levitates a styrofoam ball. After this, the user rotates the game board to direct the ball past the obstacles.
Methods - The researchers gave 4 different groups of people a MindFlex game and instructed a leader to gather people to play the group and record the reactions during playing. The video footage was later collected and analyzed for patterns.
Results - Participants were found to change their orientation in an attempt to change their level of concentration. Some people held their breath or came in closer to the ball as well. Other people lowered their concentration by not looking at the game. Another way for changing the level of concentration participants used was to imagine the ball moving up. The action of imagination caused more mental activity and caused the ball to move up.

Discussion
Considering the brain has the most entropy of data of all possible systems on the body for collecting data, I find it depressing that the current level of technology only allows such little interaction in BCI. For the future, it would be amazing to have the level of technology rise to the point where interfaces not just simply tell a difference in activity in localized places of the brain, but to concretely separate out different ideas being processed in the brain in real time. Obviously this wont come in the form of a $70 toy in any point in the near future, but I think it is possible.

Friday, November 11, 2011

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human factors in computing systems

Summary

Proceeding

CHI '11 Proceedings of the 2011 annual conference on Human

Summary

Hypothesis - The researchers believed that people would act differently when operating a brain-computer device.