To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????音??B 0011111100111111001111110011111100111111001111111000100110111001001111110011111101000010 3f3f3f3f3f3f89b93f3f42
EUC-JP ???堉??音??B 00111111001111110011111110001111101101111111110100111111001111111011001010111011001111110011111101000010 3f3f3f8fb7fd3f3fb2bb3f3f42
UTF-8 閱륁뇴堉사븦音섎섶B 11101001100101101011000111101011101001011000000111101011100001111011010011100101101000001000100111101100100000101010110011101011101110001010011011101001100111111011001111101100100001001000111011101100100001001011011001000010 e996b1eba581eb87b4e5a089ec82acebb8a6e99fb3ec848eec84b642
UHC 閱륁뇴堉사븦音섎섶B 11100110111100111000111111101100100001111001100011101011101111001011101111100111100101011000111111101011111001011001100011101011101111001011101101000010 e6f38fec8798ebbcbbe7958febe598ebbcbb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)