To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 達存達俗[達存達俗[^ 10010010010000101001000110110110100100100100001010010001101011010101101110010010010000101001000110110110100100100100001010010001101011010101101101011110 924291b6924291ad5b924291b6924291ad5b5e
EUC-JP 達存達俗[達存達俗[^ 11000011101000111100001010111000110000111010001111000010101011110101101111000011101000111100001010111000110000111010001111000010101011110101101101011110 c3a3c2b8c3a3c2af5bc3a3c2b8c3a3c2af5b5e
UTF-8 達存達俗[達存達俗[^ 111010011000000110010100111001011010110110011000111010011000000110010100111001001011111110010111010110111110100110000001100101001110010110101101100110001110100110000001100101001110010010111111100101110101101101011110 e98194e5ad98e98194e4bf975be98194e5ad98e98194e4bf975b5e
UHC 達存達俗[達存達俗[^ 11010011101110011111000011101101110100111011100111100001110101000101101111010011101110011111000011101101110100111011100111100001110101000101101101011110 d3b9f0edd3b9e1d45bd3b9f0edd3b9e1d45b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)