To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疫??奧??疫??弱 1000100101110101001111110011111110011010111110100011111100111111100010010111010100111111001111111000111011100011 89753f3f9afa3f3f89753f3f8ee3
EUC-JP 疫??奧??疫??弱 1011000111010110001111110011111111010100111111000011111100111111101100011101011000111111001111111011110011100101 b1d63f3fd4fc3f3fb1d63f3fbce5
UTF-8 疫욤뮅奧딃뮈疫욤뮅弱 111001111001011010101011111011001001101010100100111010111010111010000101111001011010010110100111111010111001010010000011111010111010111010001000111001111001011010101011111011001001101010100100111010111010111010000101111001011011110010110001 e796abec9aa4ebae85e5a5a7eb9483ebae88e796abec9aa4ebae85e5bcb1
UHC 疫욤뮅奧딃뮈疫욤뮅弱 1110011010111001101111111110100010010010100101001110011111110011100010101110100110111001101111111110011010111001101111111110100010010010100101001110010110110000 e6b9bfe89294e7f38ae9b9bfe6b9bfe89294e5b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)