To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?栖償?栖償^ 0011111110010000101100101000111110011110001111111001000010110010100011111001111001011110 3f90b28f9e3f90b28f9e5e
EUC-JP ?栖償?栖償^ 0011111111000000101101001011110111111110001111111100000010110100101111011111111001011110 3fc0b4bdfe3fc0b4bdfe5e
UTF-8 뫎栖償뫎栖償^ 11101011101010111000111011100110101000001001011011100101100001001001111111101011101010111000111011100110101000001001011011100101100001001001111101011110 ebab8ee6a096e5849febab8ee6a096e5849f5e
UHC 뫎栖償뫎栖償^ 10010001101100001101111111110111110111111100000110010001101100001101111111110111110111111100000101011110 91b0dff7dfc191b0dff7dfc15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)