To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜈??厓??厓??語?????蜈?К節?? 11100101100001010011111100111111111110101000110100111111001111111111101010001101001111110011111110001100111010100011111100111111001111110011111100111111111001011000010100111111100001000100101110010000110111110011111100111111 e5853f3ffa8d3f3ffa8d3f3f8cea3f3f3f3f3fe5853f844b90df3f3f
EUC-JP 蜈??厓??厓??語??旿??蜈?К節?? 1110100111100101001111110011111110001111101101001100011100111111001111111000111110110100110001110011111100111111101110001110110000111111001111111000111111000001111101000011111100111111111010011110010100111111101001111010110011000000111000010011111100111111 e9e53f3f8fb4c73f3f8fb4c73f3fb8ec3f3f8fc1f43f3fe9e53fa7acc0e13f3f
UTF-8 蜈졾ㄷ厓됭쪧厓곤슴語쒐쐴旿딉슁蜈좂К節몌쉬 1110100010011100100010001110110010100001101111101110001110000100101101111110010110001110100100111110101110010000101011011110110010101010101001111110010110001110100100111110101010110011101001001110110010001010101101001110100010101010100111101110110010010010100100001110110010010000101101001110011010010111101111111110101110010100100010011110110010001010100000011110100010011100100010001110110010100010100000101101000010011010111001111010111110000000111010111010101010001100111011001000100110101100 e89c88eca1bee384b7e58e93eb90adecaaa7e58e93eab3a4ec8ab4e8aa9eec9290ec90b4e697bfeb9489ec8a81e89c88eca282d09ae7af80ebaa8cec89ac
UHC 蜈졾ㄷ厓됭쪧厓곤슴語쒐쐴旿딉슁蜈좂К節몌쉬 111010001010010110100000111001011010010010100111111001001110110110001001111010001010010110100000111001001110110110110000111011111011110110111111111001011101111010011100111001111011111010100001111001111111101010001010111011111011110110110011111010001010010110100000111001111010110010101100111011111011110110111000111011111011110110101100 e8a5a0e5a4a7e4ed89e8a5a0e4edb0efbdbfe5de9ce7bea1e7fa8aefbdb3e8a5a0e7acacefbdb8efbdac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)