To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 俉??節??譽??仰??也??昻??要??E 1111101001100001001111110011111110010000110111110011111100111111111001101010001100111111001111111000101111000010001111110011111110010110111001110011111100111111111110101101000000111111001111111001011101110110001111110011111101000101 fa613f3f90df3f3fe6a33f3f8bc23f3f96e73f3ffad03f3f97763f3f45
EUC-JP 俉??節??譽??仰??也?????要??E 1000111110110001101110110011111100111111110000001110000100111111001111111110110010100101001111110011111110110110110001000011111100111111110011001110100100111111001111110011111100111111001111111100110111010111001111110011111101000101 8fb1bb3f3fc0e13f3feca53f3fb6c43f3fcce93f3f3f3f3fcdd73f3f45
UTF-8 俉녑쪍節얍떉譽낂찣仰묋뜵也뉛슭昻잒꼨要잍겣E 11100100101111111000100111101011100001011001000111101100101010101000110111100111101011111000000011101100100101101000110111101011100101101000100111101000101011011011110111101011100000101000001011101100101100001010001111100100101110111011000011101011101011001000101111101011100111001011010111100100101110011001111111101011100010011001101111101100100010101010110111100110100110001011101111101100100111101001001011101010101111001010100011101000101001101000000111101100100111101000110111101010101100101010001101000101 e4bf89eb8591ecaa8de7af80ec968deb9689e8adbdeb8282ecb0a3e4bbb0ebac8beb9cb5e4b99feb899bec8aade698bbec9e92eabca8e8a681ec9e8deab2a345
UHC 俉녑쪍節얍떉譽낂찣仰묋뜵也뉛슭昻잒꼨要잍겣E 11100111111010111011001111100101101001011000011111101111101111011011111011100101100010111001111111100111111000101000010111101001101010011001111111100100111001101001000111101000100011011011001111100101101001011000011111101111101111011011111011100100111010011001111111101000100001001000010111101001101010011001111111100110100000011011010101000101 e7ebb3e5a587efbdbee58b9fe7e285e9a99fe4e691e88db3e5a587efbdbee4e99fe88485e9a99fe681b545

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)