To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????遺???△?源??蹂λ??ъ? 001111110011111100111111001111110011111100111111100010001110001000111111001111110011111110000001101000100011111110001100101110010011111100111111111001101111100010000011110010010011111100111111100001001000110000111111 3f3f3f3f3f3f88e23f3f3f81a23f8cb93f3fe6f883c93f3f848c3f
EUC-JP ???堉??遺??繇△?源??蹂λ??ъ? 00111111001111110011111110001111101101111111110100111111001111111011000011100100001111110011111110001111110101001101000110100010101001000011111110111000101110110011111100111111111011001111101010100110110010110011111100111111101001111110110000111111 3f3f3f8fb7fd3f3fb0e43f3f8fd4d1a2a43fb8bb3f3fecfaa6cb3f3fa7ec3f
UTF-8 閱묐컾堉뷰슭遺쇳닧繇△돦源루튊蹂λ엠嶪ъ쉰 11101001100101101011000111101011101011001001000011101100101110111011111011100101101000001000100111101011101101111011000011101100100010101010110111101001100000011011101011101100100001111011001111101011100010111010011111100111101110011000011111100010100101101011001111101011100011111010011011100110101110101001000011101011101000111010100011101101100010101000101011101000101110011000001011001110101110111110110010010111101000001110010110110110101010101101000110001010111011001000100110110000 e996b1ebac90ecbbbee5a089ebb7b0ec8aade981baec87b3eb8ba7e7b987e296b3eb8fa6e6ba90eba3a8ed8a8ae8b982cebbec97a0e5b6aad18aec89b0
UHC 閱묐컾堉뷰슭遺쇳닧繇△돦源루튊蹂λ엠嶪ъ쉰 111001101111001110010001111010111011000010011111111010111011110010111010111001001011110110111110111010111011011010111100111011011000100010100011111010011010001110100001111000101000100110101010111010101011100110110111111001111011100110011110111010111011001110100101111010111011111110100101111001011111010110101100111011001011110110101110 e6f391ebb09febbcbae4bdbeebb6bced88a3e9a3a1e289aaeab9b7e7b99eebb3a5ebbfa5e5f5acecbdae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)