To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嚥??旬??鸚?嚥??旬??鸚?B 1001101010001011001111110011111110001111011110110011111100111111111010100101111100111111100110101000101100111111001111111000111101111011001111110011111111101010010111110011111101000010 9a8b3f3f8f7b3f3fea5f3f9a8b3f3f8f7b3f3fea5f3f42
EUC-JP 嚥??旬??鸚?嚥??旬??鸚?B 1101001111101011001111110011111110111101110111000011111100111111111100111100000000111111110100111110101100111111001111111011110111011100001111110011111111110011110000000011111101000010 d3eb3f3fbddc3f3ff3c03fd3eb3f3fbddc3f3ff3c03f42
UTF-8 嚥좎뼵旬닷짆鸚쬿嚥좎뼵旬닷짆鸚쬿B 11100101100110101010010111101100101000101000111011101011101111001011010111100110100101111010110011101011100010111011011111101100101001111000011011101001101110001001101011101100101011001011111111100101100110101010010111101100101000101000111011101011101111001011010111100110100101111010110011101011100010111011011111101100101001111000011011101001101110001001101011101100101011001011111101000010 e59aa5eca28eebbcb5e697aceb8bb7eca786e9b89aecacbfe59aa5eca28eebbcb5e697aceb8bb7eca786e9b89aecacbf42
UHC 嚥좎뼵旬닷짆鸚쬿嚥좎뼵旬닷짆鸚쬿B 111001101011111110100000111011001001011010111000111000101110001010110100111001011010001110010101111001011010010010100111011101101110011010111111101000001110110010010110101110001110001011100010101101001110010110100011100101011110010110100100101001110111011001000010 e6bfa0ec96b8e2e2b4e5a395e5a4a776e6bfa0ec96b8e2e2b4e5a395e5a4a77642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)