To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???苡レ????援??鷹??蘂??蟻?? 001111110011111100111111111001001000111110000011100011000011111100111111001111110011111110001001100001110011111100111111100100011110100100111111001111111110010101000001001111110011111110001011011000010011111100111111 3f3f3fe48f838c3f3f3f3f89873f3f91e93f3fe5413f3f8b613f3f
EUC-JP ???苡レ????援??鷹??蘂??蟻?? 001111110011111100111111111001111110111110100101111011000011111100111111001111110011111110110001111001110011111100111111110000101110101100111111001111111110100110100010001111110011111110110101110000100011111100111111 3f3f3fe7efa5ec3f3f3f3fb1e73f3fc2eb3f3fe9a23f3fb5c23f3f
UTF-8 列룸벊苡レ쭦列룸씍援쎿츎鷹낉폇蘂뚢뵚蟻귣씩 111011111010011010011100111010111010001110111000111010111011001010001010111010001000101110100001111000111000001110101100111011001010110110100110111011111010011010011100111010111010001110111000111011001001010010001101111001101000111110110100111011001000111010111111111011001011100010001110111010011011011110111001111010111000001010001001111011011000111110000111111010001001100010000010111010111001101010100010111010111011010110011010111010001001111110111011111010101011011110100011111011001001010010101001 efa69ceba3b8ebb28ae88ba1e383acecada6efa69ceba3b8ec948de68fb4ec8ebfecb88ee9b7b9eb8289ed8f87e89882eb9aa2ebb59ae89fbbeab7a3ec94a9
UHC 列룸벊苡レ쭦列룸씍援쎿츎鷹낉폇蘂뚢뵚蟻귣씩 111001101110101010110111111010111001001110101101111011001011111010101011111011001010011110011010111001101110101010110111111010111001110110100100111010101011010110011011111001101010111010001001111010111110110110000101111011111011110010010100111001111101111010001100111000101001010010011010111010111111110010000010111010111011111010111111 e6eab7eb93adecbeabeca79ae6eab7eb9da4eab59be6ae89ebed85efbc94e7de8ce2949aebfc82ebbebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)