To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??娃??遙??節??娃??敖??撓θ? 1000110011100110001111110011111110001000101000010011111100111111111010101010000100111111001111111001000011011111001111110011111110001000101000010011111100111111100111011100001000111111001111111001110110011010100000111100011000111111 8ce63f3f88a13f3feaa13f3f90df3f3f88a13f3f9dc23f3f9d9a83c63f
EUC-JP 梧??娃??遙??節??娃??敖??撓θ? 1011100011101000001111110011111110110000101000110011111100111111111101001010001100111111001111111100000011100001001111110011111110110000101000110011111100111111110110101100010000111111001111111101100111111010101001101100100000111111 b8e83f3fb0a33f3ff4a33f3fc0e13f3fb0a33f3fdac43f3fd9faa6c83f
UTF-8 梧잞쉽娃됵슭遙룡눟節ㅿ쉽娃됵슭敖쏉쉿撓θ녉 1110011010100010101001111110110010011110100111101110110010001001101111011110010110101000100000111110101110010000101101011110110010001010101011011110100110000001100110011110101110100011101000011110101110001000100111111110011110101111100000001110001110000101101111111110110010001001101111011110010110101000100000111110101110010000101101011110110010001010101011011110011010010101100101101110110010001111100010011110110010001001101111111110011010010010100100111100111010111000111010111000010110001001 e6a2a7ec9e9eec89bde5a883eb90b5ec8aade98199eba3a1eb889fe7af80e385bfec89bde5a883eb90b5ec8aade69596ec8f89ec89bfe69293ceb8eb8589
UHC 梧잞쉽娃됵슭遙룡눟節ㅿ쉽娃됵슭敖쏉쉿撓θ녉 111001111111110010011111111011111011110110110001111010001101111110001001111011111011110110111110111010011010101110110111111001101000011110110111111011111011110110100100111011111011110110110001111010001101111110001001111011111011110110111110111001111111100110011011111011111011110110110010111010001111010110100101111010001000011010111111 e7fc9fefbdb1e8df89efbdbee9abb7e687b7efbda4efbdb1e8df89efbdbee7f99befbdb2e8f5a5e886bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)