To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 蛟溷ウエ阡疲昏闃 1110010110000000100111111110010110110011101101001110100010010100100101001110011010001101101010001110100010001010 e5809fe5b3b4e89494e68da8e88a
EUC-JP 蛟溷ウエ阡疲昏闃 11101001111000001101111011100111100011101011001110001110101101001110111111110100110010001110100010111010101010101110111111101010 e9e0dee78eb38eb4eff4c8e8baaaefea
UTF-8 蛟溷ウエ阡疲昏闃 111010001001101110011111111001101011101010110111111011111011110110110011111011111011110110110100111010011001100010100001111001111001011010110010111001101001100010001111111010011001011110000011 e89b9fe6bab7efbdb3efbdb4e998a1e796b2e6988fe99783
UHC 蛟???阡疲昏? 110011101111000100111111001111110011111111110100110001101111100110101010111110111110011100111111 cef13f3f3ff4c6f9aafbe73f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)