To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 坦卒炭達[坦卒炭達[^ 10010010010100101001000110110010100100100101100110010010010000100101101110010010010100101001000110110010100100100101100110010010010000100101101101011110 925291b2925992425b925291b2925992425b5e
EUC-JP 坦卒炭達[坦卒炭達[^ 11000011101100111100001010110100110000111011101011000011101000110101101111000011101100111100001010110100110000111011101011000011101000110101101101011110 c3b3c2b4c3bac3a35bc3b3c2b4c3bac3a35b5e
UTF-8 坦卒炭達[坦卒炭達[^ 111001011001110110100110111001011000110110010010111001111000001010101101111010011000000110010100010110111110010110011101101001101110010110001101100100101110011110000010101011011110100110000001100101000101101101011110 e59da6e58d92e782ade981945be59da6e58d92e782ade981945b5e
UHC 坦卒炭達[坦卒炭達[^ 11110111101001001111000011101111111101111010100111010011101110010101101111110111101001001111000011101111111101111010100111010011101110010101101101011110 f7a4f0eff7a9d3b95bf7a4f0eff7a9d3b95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)