To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN ?ョ????僥??i?ョ????僥??iB 00111111100000111000011100111111001111110011111100111111100110010100011000111111001111110110100100111111100000111000011100111111001111110011111100111111100110010100011000111111001111110110100101000010 3f83873f3f3f3f99463f3f693f83873f3f3f3f99463f3f6942
EUC-JP 縯ョ?獒??僥??i縯ョ?獒??僥??iB 100011111101010011001011101001011110011100111111100011111100101110111011001111110011111111010001101001110011111100111111011010011000111111010100110010111010010111100111001111111000111111001011101110110011111100111111110100011010011100111111001111110110100101000010 8fd4cba5e73f8fcbbb3f3fd1a73f3f698fd4cba5e73f8fcbbb3f3fd1a73f3f6942
UTF-8 縯ョㅊ獒섓슭僥쀯쉴i縯ョㅊ獒섓슭僥쀯쉴iB 111001111011100010101111111000111000001110100111111000111000010110001010111001111000110110010010111011001000010010010011111011001000101010101101111001011000001110100101111011001000000010101111111011001000100110110100011010011110011110111000101011111110001110000011101001111110001110000101100010101110011110001101100100101110110010000100100100111110110010001010101011011110010110000011101001011110110010000000101011111110110010001001101101000110100101000010 e7b8afe383a7e3858ae78d92ec8493ec8aade583a5ec80afec89b469e7b8afe383a7e3858ae78d92ec8493ec8aade583a5ec80afec89b46942
UHC 縯ョㅊ獒섓슭僥쀯쉴i縯ョㅊ獒섓슭僥쀯쉴iB 111001101110000010101011111001111010010010111010111010001010001110011000111011111011110110111110111010001110100110010111111011111011110110101111011010011110011011100000101010111110011110100100101110101110100010100011100110001110111110111101101111101110100011101001100101111110111110111101101011110110100101000010 e6e0abe7a4bae8a398efbdbee8e997efbdaf69e6e0abe7a4bae8a398efbdbee8e997efbdaf6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)