To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ??????????????????鰻?E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001001010101100011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f89563f45
EUC-JP ??????????????????鰻?E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110001101101110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb1b73f45
UTF-8 렻읕렟렺렊렻렡렻旼렧렻읕렟렺렊렻렡렻鰻렟E 11101011101000001011101111101100100111011001010111101011101000001001111111101011101000001011101011101011101000001000101011101011101000001011101111101011101000001010000111101011101000001011101111100110100101111011110011101011101000001010011111101011101000001011101111101100100111011001010111101011101000001001111111101011101000001011101011101011101000001000101011101011101000001011101111101011101000001010000111101011101000001011101111101001101100001011101111101011101000001001111101000101 eba0bbec9d95eba09feba0baeba08aeba0bbeba0a1eba0bbe697bceba0a7eba0bbec9d95eba09feba0baeba08aeba0bbeba0a1eba0bbe9b0bbeba09f45
UHC 렻읕렟렺렊렻렡렻旼렧렻읕렟렺렊렻렡렻鰻렟E 1000111011000011110000001100010010001110101100001000111011000010100011101010000110001110110000111000111010110010100011101100001111011010110001001000111010110110100011101100001111000000110001001000111010110000100011101100001010001110101000011000111011000011100011101011001010001110110000111101100011000100100011101011000001000101 8ec3c0c48eb08ec28ea18ec38eb28ec3dac48eb68ec3c0c48eb08ec28ea18ec38eb28ec3d8c48eb045

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)