To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????鋼?┝ 0011111100111111001111110011111100111111001111111000110101111100001111111000010010111010 3f3f3f3f3f3f8d7c3f84ba
EUC-JP ??????鋼?┝ 0011111100111111001111110011111100111111001111111011100111011101001111111010100010111100 3f3f3f3f3f3fb9dd3fa8bc
UTF-8 센솎셈롛뤰탮鋼씔┝ 111011001000010010111100111011001000011010001110111011001000010110001000111010111010000110011011111010111010010010110000111011011000001110101110111010011000101110111100111011001001010010010100111000101001010010011101 ec84bcec868eec8588eba19beba4b0ed83aee98bbcec9494e2949d
UHC 센솎셈롛뤰탮鋼씔┝ 101111001011111010111100110101001011110011000000100011101101111110001111110111101011010110001110110010111011110010111110101111001010011010111100 bcbebcd4bcc08edf8fdeb58ecbbcbebca6bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)