To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????F}v???????F}vB 001111110011111100111111001111110011111100111111001111110100011001111101011101100011111100111111001111110011111100111111001111110011111101000110011111010111011001000010 3f3f3f3f3f3f3f467d763f3f3f3f3f3f3f467d7642
SJIS-WIN 鴉フ諶ソ治柴軸F}v鴉フ諶ソ治柴軸F}vB 11101001111010111100110011111011101010101011111110001110101000011000111011000100100011101011001001000110011111010111011011101001111010111100110011111011101010101011111110001110101000011000111011000100100011101011001001000110011111010111011001000010 e9ebccfbaabf8ea18ec48eb2467d76e9ebccfbaabf8ea18ec48eb2467d7642
EUC-JP 鴉フ諶ソ治柴軸F}v鴉フ諶ソ治柴軸F}vB 11110010111011011000111011001100100011111101111010110101100011101011111110111100101000111011110011000110101111001011010001000110011111010111011011110010111011011000111011001100100011111101111010110101100011101011111110111100101000111011110011000110101111001011010001000110011111010111011001000010 f2ed8ecc8fdeb58ebfbca3bcc6bcb4467d76f2ed8ecc8fdeb58ebfbca3bcc6bcb4467d7642
UTF-8 鴉フ諶ソ治柴軸F}v鴉フ諶ソ治柴軸F}vB 11101001101101001000100111101111101111101000110011101000101010111011011011101111101111011011111111100110101100101011101111100110100111111011010011101000101110111011100001000110011111010111011011101001101101001000100111101111101111101000110011101000101010111011011011101111101111011011111111100110101100101011101111100110100111111011010011101000101110111011100001000110011111010111011001000010 e9b489efbe8ce8abb6efbdbfe6b2bbe69fb4e8bbb8467d76e9b489efbe8ce8abb6efbdbfe6b2bbe69fb4e8bbb8467d7642
UHC 鴉?諶?治柴軸F}v鴉?諶?治柴軸F}vB 11100100101111000011111111100100101001100011111111110110101111011110001111000011111101011110111001000110011111010111011011100100101111000011111111100100101001100011111111110110101111011110001111000011111101011110111001000110011111010111011001000010 e4bc3fe4a63ff6bde3c3f5ee467d76e4bc3fe4a63ff6bde3c3f5ee467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)