To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN 霆企ョィ}霆企ョィ{^ 111010001011101110001010111010011010111010101000011111011110100010111011100010101110100110101110101010000111101101011110 e8bb8ae9aea87de8bb8ae9aea87b5e
EUC-JP 霆企ョィ}霆企ョィ{^ 11110000101111011011010011101011100011101010111010001110101010000111110111110000101111011011010011101011100011101010111010001110101010000111101101011110 f0bdb4eb8eae8ea87df0bdb4eb8eae8ea87b5e
UTF-8 霆企ョィ}霆企ョィ{^ 111010011001110010000110111001001011110010000001111011111011110110101110111011111011110110101000011111011110100110011100100001101110010010111100100000011110111110111101101011101110111110111101101010000111101101011110 e99c86e4bc81efbdaeefbda87de99c86e4bc81efbdaeefbda87b5e
UHC 霆企??}霆企??{^ 111011111111110111010000111010100011111100111111011111011110111111111101110100001110101000111111001111110111101101011110 effdd0ea3f3f7deffdd0ea3f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)