To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN 鴉?員?}鴉?員?{^ 111010011110101100111111100010001111010100111111011111011110100111101011001111111000100011110101001111110111101101011110 e9eb3f88f53f7de9eb3f88f53f7b5e
EUC-JP 鴉?員?}鴉?員?{^ 111100101110110100111111101100001111011100111111011111011111001011101101001111111011000011110111001111110111101101011110 f2ed3fb0f73f7df2ed3fb0f73f7b5e
UTF-8 鴉렍員렠}鴉렍員렠{^ 111010011011010010001001111010111010000010001101111001011001001110100001111010111010000010100000011111011110100110110100100010011110101110100000100011011110010110010011101000011110101110100000101000000111101101011110 e9b489eba08de593a1eba0a07de9b489eba08de593a1eba0a07b5e
UHC 鴉렍員렠}鴉렍員렠{^ 11100100101111001000111010100011111010101010110010001110101100010111110111100100101111001000111010100011111010101010110010001110101100010111101101011110 e4bc8ea3eaac8eb17de4bc8ea3eaac8eb17b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)