To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????????嬋^ 0011111100111111001111110011111100111111001111110011111100111111100110110110100001011110 3f3f3f3f3f3f3f3f9b685e
EUC-JP ????????嬋^ 0011111100111111001111110011111100111111001111110011111100111111110101011100100101011110 3f3f3f3f3f3f3f3fd5c95e
UTF-8 센셈센솥센셈센셀嬋^ 11101100100001001011110011101100100001011000100011101100100001001011110011101100100001101010010111101100100001001011110011101100100001011000100011101100100001001011110011101100100001011000000011100101101011001000101101011110 ec84bcec8588ec84bcec86a5ec84bcec8588ec84bcec8580e5ac8b5e
UHC 센셈센솥센셈센셀嬋^ 10111100101111101011110011000000101111001011111010111100110111001011110010111110101111001100000010111100101111101011110010111111111000001011110101011110 bcbebcc0bcbebcdcbcbebcc0bcbebcbfe0bd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)