To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????D??tv??????D??tvB 0011111100111111001111110011111100111111001111110100010000111111001111110111010001110110001111110011111100111111001111110011111100111111010001000011111100111111011101000111011001000010 3f3f3f3f3f3f443f3f74763f3f3f3f3f3f443f3f747642
SJIS-WIN 嶸ゥ嶸ヲ嶸ォD嶸「tv嶸ゥ嶸ヲ嶸ォD嶸「tvB 11111010101101001010100111111010101101001010011011111010101101001010101101000100111110101011010010100010011101000111011011111010101101001010100111111010101101001010011011111010101101001010101101000100111110101011010010100010011101000111011001000010 fab4a9fab4a6fab4ab44fab4a27476fab4a9fab4a6fab4ab44fab4a2747642
EUC-JP 嶸ゥ嶸ヲ嶸ォD嶸「tv嶸ゥ嶸ヲ嶸ォD嶸「tvB 1000111110111011111101001000111010101001100011111011101111110100100011101010011010001111101110111111010010001110101010110100010010001111101110111111010010001110101000100111010001110110100011111011101111110100100011101010100110001111101110111111010010001110101001101000111110111011111101001000111010101011010001001000111110111011111101001000111010100010011101000111011001000010 8fbbf48ea98fbbf48ea68fbbf48eab448fbbf48ea274768fbbf48ea98fbbf48ea68fbbf48eab448fbbf48ea2747642
UTF-8 嶸ゥ嶸ヲ嶸ォD嶸「tv嶸ゥ嶸ヲ嶸ォD嶸「tvB 11100101101101101011100011101111101111011010100111100101101101101011100011101111101111011010011011100101101101101011100011101111101111011010101101000100111001011011011010111000111011111011110110100010011101000111011011100101101101101011100011101111101111011010100111100101101101101011100011101111101111011010011011100101101101101011100011101111101111011010101101000100111001011011011010111000111011111011110110100010011101000111011001000010 e5b6b8efbda9e5b6b8efbda6e5b6b8efbdab44e5b6b8efbda27476e5b6b8efbda9e5b6b8efbda6e5b6b8efbdab44e5b6b8efbda2747642
UHC 嶸?嶸?嶸?D嶸?tv嶸?嶸?嶸?D嶸?tvB 11100111101011100011111111100111101011100011111111100111101011100011111101000100111001111010111000111111011101000111011011100111101011100011111111100111101011100011111111100111101011100011111101000100111001111010111000111111011101000111011001000010 e7ae3fe7ae3fe7ae3f44e7ae3f7476e7ae3fe7ae3fe7ae3f44e7ae3f747642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)