To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???二??醫??炎 00111111001111110011111110010011111100010011111100111111111001111100111000111111001111111000100110001010 3f3f3f93f13f3fe7ce3f3f898a
EUC-JP ???二??醫??炎 00111111001111110011111111000110111100110011111100111111111011101101000000111111001111111011000111101010 3f3f3fc6f33f3feed03f3fb1ea
UTF-8 裂╉굢二뷴쉐醫꾩뿊炎 111011111010011010100000111000101001010110001001111010101011010110100010111001001011101010001100111010111011011110110100111011001000100110010000111010011000011010101011111010101011111010101001111010111011111110001010111001111000001010001110 efa6a0e29589eab5a2e4ba8cebb7b4ec8990e986abeabea9ebbf8ae7828e
UHC 裂╉굢二뷴쉐醫꾩뿊炎 1110011011110001101001101110001110000010100010011110110010100011101110101110010110111101101001101110110010100010100001001110110010010111100100011110011011111010 e6f1a6e38289eca3bae5bda6eca284ec9791e6fa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)