To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌?????飮??嚥≪?筌?????飮??嚥≪?B 111000101010001100111111001111110011111100111111001111111001111101011010001111110011111110011010100010111000000111100001001111111110001010100011001111110011111100111111001111110011111110011111010110100011111100111111100110101000101110000001111000010011111101000010 e2a33f3f3f3f3f9f5a3f3f9a8b81e13fe2a33f3f3f3f3f9f5a3f3f9a8b81e13f42
EUC-JP 筌??嫄??飮??嚥≪?筌??嫄??飮??嚥≪?B 11100100101001010011111100111111100011111011101010100001001111110011111111011101101110110011111100111111110100111110101110100010111000110011111111100100101001010011111100111111100011111011101010100001001111110011111111011101101110110011111100111111110100111110101110100010111000110011111101000010 e4a53f3f8fbaa13f3fddbb3f3fd3eba2e33fe4a53f3f8fbaa13f3fddbb3f3fd3eba2e33f42
UTF-8 筌뚯쉧嫄띄땟飮뉗졑嚥≪돘筌뚯쉧嫄띄땟飮뉗졑嚥≪돘B 11100111101011011000110011101011100110101010111111101100100010011010011111100101101010111000010011101011100111011000010011101011100101011001111111101001101000111010111011101011100010011001011111101100101000011001000111100101100110101010010111100010100010011010101011101011100011111001100011100111101011011000110011101011100110101010111111101100100010011010011111100101101010111000010011101011100111011000010011101011100101011001111111101001101000111010111011101011100010011001011111101100101000011001000111100101100110101010010111100010100010011010101011101011100011111001100001000010 e7ad8ceb9aafec89a7e5ab84eb9d84eb959fe9a3aeeb8997eca191e59aa5e289aaeb8f98e7ad8ceb9aafec89a7e5ab84eb9d84eb959fe9a3aeeb8997eca191e59aa5e289aaeb8f9842
UHC 筌뚯쉧嫄띄땟飮뉗졑嚥≪돘筌뚯쉧嫄띄땟飮뉗졑嚥≪돘B 11101111101001111000110011101100100110101000000111101010101100011011011011100111101101101010110111101011111001101000011111101100101000001011111011100110101111111010000111101100100010011010000111101111101001111000110011101100100110101000000111101010101100011011011011100111101101101010110111101011111001101000011111101100101000001011111011100110101111111010000111101100100010011010000101000010 efa78cec9a81eab1b6e7b6adebe687eca0bee6bfa1ec89a1efa78cec9a81eab1b6e7b6adebe687eca0bee6bfa1ec89a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)