To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}B 0011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f7d42
SJIS-WIN 他達達短捉遜}B 1001000110111100100100100100001010010010010000101001001001011010100100011010100010010001101110110111110101000010 91bc92429242925a91a891bb7d42
EUC-JP 他達達短捉遜}B 1100001010111110110000111010001111000011101000111100001110111011110000101010101011000010101111010111110101000010 c2bec3a3c3a3c3bbc2aac2bd7d42
UTF-8 他達達短捉遜}B 1110010010111011100101101110100110000001100101001110100110000001100101001110011110011111101011011110011010001101100010011110100110000001100111000111110101000010 e4bb96e98194e98194e79fade68d89e9819c7d42
UHC 他達達短捉遜}B 1111011011100010110100111011100111010011101110011101001110101101111100111011010111100001111000010111110101000010 f6e2d3b9d3b9d3adf3b5e1e17d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)