To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ?消ю?警漿?班劇 001111111000111111000001100001001001000000111111100011000111100010011111111101110011111110010100110001111000110010000000 3f8fc184903f8c789ff73f94c78c80
EUC-JP ?消ю?警漿?班劇 001111111011111011000011101001111111000000111111101101111101100111011110111110010011111111001000110010011011011111100000 3fbec3a7f03fb7d9def93fc8c9b7e0
UTF-8 뤵消ю퓥警漿콒班劇 1110101110100100101101011110011010110110100010001101000110001110111011011001001110100101111010001010110110100110111001101011110010111111111011001011110110010010111001111000111110101101111001011000101010000111 eba4b5e6b688d18eed93a5e8ada6e6bcbfecbd92e78fade58a87
UHC 뤵消ю퓥警漿콒班劇 100011111110001111100001101111001010110011110000101111111000111011001100111011011110110111101100101100011000111011011010111011001101000010111100 8fe3e1bcacf0bf8eccededecb18edaecd0bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)