To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猥??弱?????B 111000001100111000111111001111111000111011100011001111110011111100111111001111110011111101000010 e0ce3f3f8ee33f3f3f3f3f42
EUC-JP 猥??弱?????B 111000001101000000111111001111111011110011100101001111110011111100111111001111110011111101000010 e0d03f3fbce53f3f3f3f3f42
UTF-8 猥띾젻弱뉗눖溜잙젻B 11100111100011001010010111101011100111011011111011101100101000001011101111100101101111001011000111101011100010011001011111101011100010001001011011101111101001111000101111101100100111101001100111101100101000001011101101000010 e78ca5eb9dbeeca0bbe5bcb1eb8997eb8896efa78bec9e99eca0bb42
UHC 猥띾젻弱뉗눖溜잙젻B 11101000111001011000110111101011101000001010111011100101101100001000011111101100100001111011000011101010111111101001111111101011101000001010111001000010 e8e58deba0aee5b087ec87b0eafe9feba0ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)