To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????F}v???????F}vB 001111110011111100111111001111110011111100111111001111110100011001111101011101100011111100111111001111110011111100111111001111110011111101000110011111010111011001000010 3f3f3f3f3f3f3f467d763f3f3f3f3f3f3f467d7642
SJIS-WIN 齏フ諶ソ治柴軸F}v齏フ諶ソ治柴軸F}vB 11101000111010111100110011111011101010101011111110001110101000011000111011000100100011101011001001000110011111010111011011101000111010111100110011111011101010101011111110001110101000011000111011000100100011101011001001000110011111010111011001000010 e8ebccfbaabf8ea18ec48eb2467d76e8ebccfbaabf8ea18ec48eb2467d7642
EUC-JP 齏フ諶ソ治柴軸F}v齏フ諶ソ治柴軸F}vB 11110000111011011000111011001100100011111101111010110101100011101011111110111100101000111011110011000110101111001011010001000110011111010111011011110000111011011000111011001100100011111101111010110101100011101011111110111100101000111011110011000110101111001011010001000110011111010111011001000010 f0ed8ecc8fdeb58ebfbca3bcc6bcb4467d76f0ed8ecc8fdeb58ebfbca3bcc6bcb4467d7642
UTF-8 齏フ諶ソ治柴軸F}v齏フ諶ソ治柴軸F}vB 11101001101111011000111111101111101111101000110011101000101010111011011011101111101111011011111111100110101100101011101111100110100111111011010011101000101110111011100001000110011111010111011011101001101111011000111111101111101111101000110011101000101010111011011011101111101111011011111111100110101100101011101111100110100111111011010011101000101110111011100001000110011111010111011001000010 e9bd8fefbe8ce8abb6efbdbfe6b2bbe69fb4e8bbb8467d76e9bd8fefbe8ce8abb6efbdbfe6b2bbe69fb4e8bbb8467d7642
UHC ??諶?治柴軸F}v??諶?治柴軸F}vB 0011111100111111111001001010011000111111111101101011110111100011110000111111010111101110010001100111110101110110001111110011111111100100101001100011111111110110101111011110001111000011111101011110111001000110011111010111011001000010 3f3fe4a63ff6bde3c3f5ee467d763f3fe4a63ff6bde3c3f5ee467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)