To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汁?????汁???雋孟???遠????瓦? 10001111011000000011111100111111001111110011111100111111100011110110000000111111001111110011111111101000101100101001011011010000001111110011111100111111100010011001001100111111001111110011111100111111100010101010001000111111 8f603f3f3f3f3f8f603f3f3fe8b296d03f3f3f89933f3f3f3f8aa23f
EUC-JP 汁???焌?汁???雋孟???遠????瓦? 101111011100000100111111001111110011111110001111110010011110100000111111101111011100000100111111001111110011111111110000101101001100110011010010001111110011111100111111101100011111001100111111001111110011111100111111101101001010010000111111 bdc13f3f3f8fc9e83fbdc13f3f3ff0b4ccd23f3f3fb1f33f3f3f3fb4a43f
UTF-8 汁흗렓렜焌렠汁흗렓렜雋孟웃渽렜遠펭웃渽렜瓦슭 111001101011000110000001111011011001110110010111111010111010000010010011111010111010000010011100111001111000010010001100111010111010000010100000111001101011000110000001111011011001110110010111111010111010000010010011111010111010000010011100111010011001101110001011111001011010110110011111111011001001101110000011111001101011100010111101111010111010000010011100111010011000000110100000111011011000111010101101111011001001101110000011111001101011100010111101111010111010000010011100111001111001001110100110111011001000101010101101 e6b181ed9d97eba093eba09ce7848ceba0a0e6b181ed9d97eba093eba09ce99b8be5ad9fec9b83e6b8bdeba09ce981a0ed8eadec9b83e6b8bdeba09ce793a6ec8aad
UHC 汁흗렓렜焌렠汁흗렓렜雋孟웃渽렜遠펭웃渽렜瓦슭 1111000111110000110010001110100110001110101010001000111010101110111100011110000010001110101100011111000111110000110010001110100110001110101010001000111010101110111100011110011011011000111010111011111111110100111011101010101010001110101011101110101011000000110001101110101110111111111101001110111010101010100011101010111011101000101111111011110110111110 f1f0c8e98ea88eaef1e08eb1f1f0c8e98ea88eaef1e6d8ebbff4eeaa8eaeeac0c6ebbff4eeaa8eaee8bfbdbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)