To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥??議??邑??筌??萸?????筌??誼? 100110101000101100111111001111111000101101100011001111110011111110010111010101110011111100111111111000101010001100111111001111111110010011001110001111110011111100111111001111110011111111100010101000110011111100111111100010110110001000111111 9a8b3f3f8b633f3f97573f3fe2a33f3fe4ce3f3f3f3f3fe2a33f3f8b623f
EUC-JP 嚥??議??邑??筌??萸??洧??筌??誼? 1101001111101011001111110011111110110101110001000011111100111111110011011011100000111111001111111110010010100101001111110011111111101000110100000011111100111111100011111100011110110100001111110011111111100100101001010011111100111111101101011100001100111111 d3eb3f3fb5c43f3fcdb83f3fe4a53f3fe8d03f3f8fc7b43f3fe4a53f3fb5c33f
UTF-8 嚥싲갭議낉쭓邑뀁죪筌뗭떓萸먲쭓洧뺤젩筌뗫쓹誼좦 111001011001101010100101111011001000101110110010111010101011000010101101111010001010110110110000111010111000001010001001111011001010110110010011111010011000001010010001111010111000000010000001111011001010001110101010111001111010110110001100111010111001011110101101111010111001011010010011111010001001000010111000111010111010100010110010111011001010110110010011111001101011010010100111111010111011101010100100111011001010000010101001111001111010110110001100111010111001011110101011111011001001001110111001111010001010101010111100111011001010001010100110 e59aa5ec8bb2eab0ade8adb0eb8289ecad93e98291eb8081eca3aae7ad8ceb97adeb9693e890b8eba8b2ecad93e6b4a7ebbaa4eca0a9e7ad8ceb97abec93b9e8aabceca2a6
UHC 嚥싲갭議낉쭓邑뀁죪筌뗭떓萸먲쭓洧뺤젩筌뗫쓹誼좦 11100110101111111001101011101011101100001011100011101100101000011000010111101111101001111000101111101011111010011011001011101100101000011000010111101111101001111000101111101100100010111010100111101011101011011001000011101111101001111000101111101010111110111001010111101100101000001010000111101111101001111000101111101011100111011001010111101011111111101010000101000010 e6bf9aebb0b8eca185efa78bebe9b2eca185efa78bec8ba9ebad90efa78beafb95eca0a1efa78beb9d95ebfea142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)