To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??ゴ?悠ゲ◇?鷹イ 00111111001111111000001101010011001111111001011101001001100000110101000110000001100111100011111110010001111010011000001101000011 3f3f83533f97498351819e3f91e98343
EUC-JP ??ゴ?悠ゲ◇?鷹イ 00111111001111111010010110110100001111111100110110101010101001011011001010100001111111100011111111000010111010111010010110100100 3f3fa5b43fcdaaa5b2a1fe3fc2eba5a4
UTF-8 룶쾹ゴ룫悠ゲ◇룫鷹イ 111010111010001110110110111011001011111010111001111000111000001010110100111010111010001110101011111001101000001010100000111000111000001010110010111000101001011110000111111010111010001110101011111010011011011110111001111000111000001010100100 eba3b6ecbeb9e382b4eba3abe682a0e382b2e29787eba3abe9b7b9e382a4
UHC 룶쾹ゴ룫悠ゲ◇룫鷹イ 1000111110101011101100101000111110101011101101001000111110100010111010101110110110101011101100101010000111011110100011111010001011101011111011011010101110100100 8fabb28fabb48fa2eaedabb2a1de8fa2ebedaba4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)