To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???率??幽??^ 001111110011111100111111100101111010011000111111001111111001011101001000001111110011111101011110 3f3f3f97a63f3f97483f3f5e
EUC-JP ???率??幽??^ 001111110011111100111111110011101010100000111111001111111100110110101001001111110011111101011110 3f3f3fcea83f3fcda93f3f5e
UTF-8 嶪용뛼率방뿿幽귘끃^ 11100101101101101010101011101100100110101010100111101011100110111011110011100111100011101000011111101011101100001010100111101011101111111011111111100101101110011011110111101010101101111001100011101011100000011000001101011110 e5b6aaec9aa9eb9bbce78e87ebb0a9ebbfbfe5b9bdeab798eb81835e
UHC 嶪용뛼率방뿿幽귘끃^ 11100101111101011011111111101011100011011000001011100001111000111011100111100110100101111011111111101010111010111000001011100010100001011011100101011110 e5f5bfeb8d82e1e3b9e697bfeaeb82e285b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)