To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 貉ソ驕ョ貉ソ謔噂}v貉ソ驕ョ貉ソ謔噂}vB 11100110101110011011111111101001100000011010111011100110101110011011111111100110100000101000100101011100011111010111011011100110101110011011111111101001100000011010111011100110101110011011111111100110100000101000100101011100011111010111011001000010 e6b9bfe981aee6b9bfe682895c7d76e6b9bfe981aee6b9bfe682895c7d7642
EUC-JP 貉ソ驕ョ貉ソ謔噂}v貉ソ驕ョ貉ソ謔噂}vB 11101100101110111000111010111111111100011110000110001110101011101110110010111011100011101011111111101011111000101011000110111101011111010111011011101100101110111000111010111111111100011110000110001110101011101110110010111011100011101011111111101011111000101011000110111101011111010111011001000010 ecbb8ebff1e18eaeecbb8ebfebe2b1bd7d76ecbb8ebff1e18eaeecbb8ebfebe2b1bd7d7642
UTF-8 貉ソ驕ョ貉ソ謔噂}v貉ソ驕ョ貉ソ謔噂}vB 1110100010110010100010011110111110111101101111111110100110101001100101011110111110111101101011101110100010110010100010011110111110111101101111111110100010101100100101001110010110011001100000100111110101110110111010001011001010001001111011111011110110111111111010011010100110010101111011111011110110101110111010001011001010001001111011111011110110111111111010001010110010010100111001011001100110000010011111010111011001000010 e8b289efbdbfe9a995efbdaee8b289efbdbfe8ac94e599827d76e8b289efbdbfe9a995efbdaee8b289efbdbfe8ac94e599827d7642
UHC ??驕???謔?}v??驕???謔?}vB 00111111001111111100111011110110001111110011111100111111111110011100110000111111011111010111011000111111001111111100111011110110001111110011111100111111111110011100110000111111011111010111011001000010 3f3fcef63f3f3ff9cc3f7d763f3fcef63f3f3ff9cc3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)