To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嚥〓?誼??膺??筌??嚥〓?誼??膺??筌??B 1001101010001011100000011010110000111111100010110110001000111111001111111110010001011110001111110011111111100010101000110011111100111111100110101000101110000001101011000011111110001011011000100011111100111111111001000101111000111111001111111110001010100011001111110011111101000010 9a8b81ac3f8b623f3fe45e3f3fe2a33f3f9a8b81ac3f8b623f3fe45e3f3fe2a33f3f42
EUC-JP 嚥〓Ŧ誼??膺??筌??嚥〓Ŧ誼??膺??筌??B 110100111110101110100010101011101000111110101001101011111011010111000011001111110011111111100111101111110011111100111111111001001010010100111111001111111101001111101011101000101010111010001111101010011010111110110101110000110011111100111111111001111011111100111111001111111110010010100101001111110011111101000010 d3eba2ae8fa9afb5c33f3fe7bf3f3fe4a53f3fd3eba2ae8fa9afb5c33f3fe7bf3f3fe4a53f3f42
UTF-8 嚥〓Ŧ誼욥젏膺덉젟筌뗭퐚嚥〓Ŧ誼욥젏膺덉젟筌뗭퐚B 1110010110011010101001011110001110000000100100111100010110100110111010001010101010111100111011001001101010100101111011001010000010001111111010001000011010111010111010111000110110001001111011001010000010011111111001111010110110001100111010111001011110101101111011011001000010011010111001011001101010100101111000111000000010010011110001011010011011101000101010101011110011101100100110101010010111101100101000001000111111101000100001101011101011101011100011011000100111101100101000001001111111100111101011011000110011101011100101111010110111101101100100001001101001000010 e59aa5e38093c5a6e8aabcec9aa5eca08fe886baeb8d89eca09fe7ad8ceb97aded909ae59aa5e38093c5a6e8aabcec9aa5eca08fe886baeb8d89eca09fe7ad8ceb97aded909a42
UHC 嚥〓Ŧ誼욥젏膺덉젟筌뗭퐚嚥〓Ŧ誼욥젏膺덉젟筌뗭퐚B 11100110101111111010000111101011101010001010111011101011111111101011111111101001101000001001000011101011111011001000100011101100101000001001100111101111101001111000101111101100101111011000010011100110101111111010000111101011101010001010111011101011111111101011111111101001101000001001000011101011111011001000100011101100101000001001100111101111101001111000101111101100101111011000010001000010 e6bfa1eba8aeebfebfe9a090ebec88eca099efa78becbd84e6bfa1eba8aeebfebfe9a090ebec88eca099efa78becbd8442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)