To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN 妖?誼?}妖?誼?{^ 100101110110010000111111100010110110001000111111011111011001011101100100001111111000101101100010001111110111101101011110 97643f8b623f7d97643f8b623f7b5e
EUC-JP 妖?誼?}妖?誼?{^ 110011011100010100111111101101011100001100111111011111011100110111000101001111111011010111000011001111110111101101011110 cdc53fb5c33f7dcdc53fb5c33f7b5e
UTF-8 妖렓誼완}妖렓誼완{^ 111001011010011010010110111010111010000010010011111010001010101010111100111011001001100110000100011111011110010110100110100101101110101110100000100100111110100010101010101111001110110010011001100001000111101101011110 e5a696eba093e8aabcec99847de5a696eba093e8aabcec99847b5e
UHC 妖렓誼완}妖렓誼완{^ 11101000111011011000111010101000111010111111111010111111110011110111110111101000111011011000111010101000111010111111111010111111110011110111101101011110 e8ed8ea8ebfebfcf7de8ed8ea8ebfebfcf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)