To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴉?????媛??矣?????愉?Р諛?? 111010011110101100111111001111110011111100111111001111111001010101010001001111110011111111100001111000010011111100111111001111110011111100111111100101101111100100111111100001000101000111100110100001110011111100111111 e9eb3f3f3f3f3f95513f3fe1e13f3f3f3f3f96f93f8451e6873f3f
EUC-JP 鴉?????媛??矣?????愉?Р諛?? 111100101110110100111111001111110011111100111111001111111100100110110010001111110011111111100010111000110011111100111111001111110011111100111111110011001111101100111111101001111011001011101011111001110011111100111111 f2ed3f3f3f3f3fc9b23f3fe2e33f3f3f3f3fccfb3fa7b2ebe73f3f
UTF-8 鴉띾맕留뚨㎣媛㏘틪矣뉗돶蓮곷뿨愉꿩Р諛깅꼹 1110100110110100100010011110101110011101101111101110101110100111100101011110111110100111100011011110101110011010101010001110001110001110101000111110010110101010100110111110001110001111100110001110110110001011101010101110011110011111101000111110101110001001100101111110101110001111101101101110111110100110100110011110101010110011101101111110101110111111101010001110011010000100100010011110101010111111101010011101000010100000111010001010101110011011111010101011100110000101111010101011110010111001 e9b489eb9dbeeba795efa78deb9aa8e38ea3e5aa9be38f98ed8baae79fa3eb8997eb8fb6efa699eab3b7ebbfa8e68489eabfa9d0a0e8ab9beab985eabcb9
UHC 鴉띾맕留뚨㎣媛㏘틪矣뉗돶蓮곷뿨愉꿩Р諛깅꼹 111001001011110010001101111010111001000010100111111010111010011110001100111001111010011110100111111010101011000010100010111001001011101010010100111010111111100010000111111011001000100110111001111001101110010110000001111010111001011110101000111010101111000010110010111001101010110010110010111010111011000010110001111010111000010010010001 e4bc8deb90a7eba78ce7a7a7eab0a2e4ba94ebf887ec89b9e6e581eb97a8eaf0b2e6acb2ebb0b1eb8491

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)