To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繒絲?郵?畯?憶烽納汀?郵?畯?憶六 111110111000111111100011010011100011111110010111010110000011111111111011011011110011111110001001101011111110000010000010100101000101101110010010111100110011111110010111010110000011111111111011011011110011111110001001101011111001100001011010 fb8fe34e3f97583ffb6f3f89afe082945b92f33f97583ffb6f3f89af985a
EUC-JP 繒絲?郵?畯?憶烽納汀?郵?畯?憶六 100011111101010011010100111001011010111100111111110011011011100100111111100011111100110110111011001111111011001010110001110111111110001011000111101111001100010011110101001111111100110110111001001111111000111111001101101110110011111110110010101100011100111110111011 8fd4d4e5af3fcdb93f8fcdbb3fb2b1dfe2c7bcc4f53fcdb93f8fcdbb3fb2b1cfbb
UTF-8 繒絲ㅆ郵렜畯렧憶烽納汀렣郵렜畯렧憶六 111001111011100110010010111001111011010110110010111000111000010110000110111010011000001110110101111010111010000010011100111001111001010110101111111010111010000010100111111001101000011010110110111001111000001110111101111001111011010010001101111001101011000110000000111010111010000010100011111010011000001110110101111010111010000010011100111001111001010110101111111010111010000010100111111001101000011010110110111001011000010110101101 e7b992e7b5b2e38586e983b5eba09ce795afeba0a7e686b6e783bde7b48de6b180eba0a3e983b5eba09ce795afeba0a7e686b6e585ad
UHC 繒絲ㅆ郵렜畯렧憶烽納汀렣郵렜畯렧憶六 111100011111100111011110111010101010010010110110111010011110100010001110101011101111000111100001100011101011011011100101111000111101110011101011110100101010000111101111111000101000111010110100111010011110100010001110101011101111000111100001100011101011011011100101111000111101011110111111 f1f9deeaa4b6e9e88eaef1e18eb6e5e3dcebd2a1efe28eb4e9e88eaef1e18eb6e5e3d7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)