To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 辟擾スシ魄難ス「}v辟擾スシ魄難ス「}vB 1110011110000100100011111110111110111101101111001110100110101110100100111110111110111101101000100111110101110110111001111000010010001111111011111011110110111100111010011010111010010011111011111011110110100010011111010111011001000010 e7848fefbdbce9ae93efbda27d76e7848fefbdbce9ae93efbda27d7642
EUC-JP 辟擾スシ魄難ス「}v辟擾スシ魄難ス「}vB 11101101111001001011111011110001100011101011110110001110101111001111001010110000110001101111000110001110101111011000111010100010011111010111011011101101111001001011111011110001100011101011110110001110101111001111001010110000110001101111000110001110101111011000111010100010011111010111011001000010 ede4bef18ebd8ebcf2b0c6f18ebd8ea27d76ede4bef18ebd8ebcf2b0c6f18ebd8ea27d7642
UTF-8 辟擾スシ魄難ス「}v辟擾スシ魄難ス「}vB 1110100010111110100111111110011010010011101111101110111110111101101111011110111110111101101111001110100110101101100001001110100110011011101000111110111110111101101111011110111110111101101000100111110101110110111010001011111010011111111001101001001110111110111011111011110110111101111011111011110110111100111010011010110110000100111010011001101110100011111011111011110110111101111011111011110110100010011111010111011001000010 e8be9fe693beefbdbdefbdbce9ad84e99ba3efbdbdefbda27d76e8be9fe693beefbdbdefbdbce9ad84e99ba3efbdbdefbda27d7642
UHC ?擾??魄難??}v?擾??魄難??}vB 001111111110100011110110001111110011111111011011110111101101000111110001001111110011111101111101011101100011111111101000111101100011111100111111110110111101111011010001111100010011111100111111011111010111011001000010 3fe8f63f3fdbded1f13f3f7d763fe8f63f3fdbded1f13f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)