To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????N??????\??????H 001111110011111100111111001111110011111100111111010011100011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101001000 3f3f3f3f3f3f4e3f3f3f3f3f3f5c3f3f3f3f3f3f48
SJIS-WIN ???史??N???史??\???史??H 001111110011111100111111100011100110101000111111001111110100111000111111001111110011111110001110011010100011111100111111010111000011111100111111001111111000111001101010001111110011111101001000 3f3f3f8e6a3f3f4e3f3f3f8e6a3f3f5c3f3f3f8e6a3f3f48
EUC-JP ???史馹?N???史馹?\???史馹?H 001111110011111100111111101110111100101110001111111010011010000100111111010011100011111100111111001111111011101111001011100011111110100110100001001111110101110000111111001111110011111110111011110010111000111111101001101000010011111101001000 3f3f3fbbcb8fe9a13f4e3f3f3fbbcb8fe9a13f5c3f3f3fbbcb8fe9a13f48
UTF-8 젬석장史馹읍N젬석장史馹읍\젬석장史馹읍H 111011001010000010101100111011001000010010011101111011001001111010100101111001011000111110110010111010011010011010111001111011001001110110001101010011101110110010100000101011001110110010000100100111011110110010011110101001011110010110001111101100101110100110100110101110011110110010011101100011010101110011101100101000001010110011101100100001001001110111101100100111101010010111100101100011111011001011101001101001101011100111101100100111011000110101001000 eca0acec849dec9ea5e58fb2e9a6b9ec9d8d4eeca0acec849dec9ea5e58fb2e9a6b9ec9d8d5ceca0acec849dec9ea5e58fb2e9a6b9ec9d8d48
UHC 젬석장史馹읍N젬석장史馹읍\젬석장史馹읍H 110000011010101010111100101011101100000011100101110111101100100011101100111100011100000010111110010011101100000110101010101111001010111011000000111001011101111011001000111011001111000111000000101111100101110011000001101010101011110010101110110000001110010111011110110010001110110011110001110000001011111001001000 c1aabcaec0e5dec8ecf1c0be4ec1aabcaec0e5dec8ecf1c0be5cc1aabcaec0e5dec8ecf1c0be48

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)