To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????IB 001111110011111100111111001111110011111100111111001111110100100101000010 3f3f3f3f3f3f3f4942
SJIS-WIN ??品?ゲ◇∵IB 00111111001111111001010101101001001111111000001101010001100000011001111010000001111001100100100101000010 3f3f95693f8351819e81e64942
EUC-JP ?đ品?ゲ◇∵IB 001111111000111110101001110000101100100111001010001111111010010110110010101000011111111010100010111010000100100101000010 3f8fa9c2c9ca3fa5b2a1fea2e84942
UTF-8 룶đ品춳ゲ◇∵IB 11101011101000111011011011000100100100011110010110010011100000011110110010110110101100111110001110000010101100101110001010010111100001111110001010001000101101010100100101000010 eba3b6c491e59381ecb6b3e382b2e29787e288b54942
UHC 룶đ品춳ゲ◇∵IB 10001111101010111010100110100010111110011010000110101101100011111010101110110010101000011101111010100001111100010100100101000010 8faba9a2f9a1ad8fabb2a1dea1f14942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)