To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????­? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010110100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fad3f
SJIS-WIN ?????ぜ喩??阿??愉??柔ル?鈺??? 0011111100111111001111110011111100111111100000101011101010011010011001110011111100111111100010001010001000111111001111111001011011111001001111110011111110001111010111111000001110001011001111111111101111000100001111110011111100111111 3f3f3f3f3f82ba9a673f3f88a23f3f96f93f3f8f5f838b3ffbc43f3f3f
EUC-JP ???沅?ぜ喩??阿??愉??柔ル?鈺??? 0011111100111111001111111000111111000110111010010011111110100100101111001101001111001000001111110011111110110000101001000011111100111111110011001111101100111111001111111011110111000000101001011110101100111111100011111110001111010101001111110011111100111111 3f3f3f8fc6e93fa4bcd3c83f3fb0a43f3fccfb3f3fbdc0a5eb3f8fe3d53f3f3f
UTF-8 嶺뚮뿭沅좄ぜ喩쏆춷阿쇺돦愉양춯柔ル늅鈺곕­劉 1110111110100110101010111110101110011010101011101110101110111111101011011110011010110010100001011110110010100010100001001110001110000001100111001110010110010110101010011110110010001111100001101110110010110110101101111110100110011000101111111110110010000111101110101110101110001111101001101110011010000100100010011110110010010110100100011110110010110110101011111110011010011111100101001110001110000011101010111110101110001010100001011110100110001000101110101110101010110011100101011100001010101101111011111010011110000111 efa6abeb9aaeebbfade6b285eca284e3819ce596a9ec8f86ecb6b7e998bfec87baeb8fa6e68489ec9691ecb6afe69f94e383abeb8a85e988baeab395c2adefa787
UHC 嶺뚮뿭沅좄ぜ喩쏆춷阿쇺돦愉양춯柔ル늅鈺곕­劉 1110011110101101100011001110101110010111101011011110101010110110101000001110100010101010101111001110101011100111100110111110110010101101100100111110010010111001100110011110001010001001101010101110101011110000101111101110011110101101100011001110101011110101101010111110101110110100101111101110100010101101101100001110101110100001101010011110101011100101 e7ad8ceb97adeab6a0e8aabceae79becad93e4b999e289aaeaf0bee7ad8ceaf5abebb4bee8adb0eba1a9eae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)