To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^`??????^OB 0011111100111111001111110011111100111111001111110101111001100000001111110011111100111111001111110011111100111111010111100100111101000010 3f3f3f3f3f3f5e603f3f3f3f3f3f5e4f42
SJIS-WIN 蹂シ」蹂シォ^`蹂シ」蹂シォ^OB 111001101111100010111100101000111110011011111000101111001010101101011110011000001110011011111000101111001010001111100110111110001011110010101011010111100100111101000010 e6f8bca3e6f8bcab5e60e6f8bca3e6f8bcab5e4f42
EUC-JP 蹂シ」蹂シォ^`蹂シ」蹂シォ^OB 1110110011111010100011101011110010001110101000111110110011111010100011101011110010001110101010110101111001100000111011001111101010001110101111001000111010100011111011001111101010001110101111001000111010101011010111100100111101000010 ecfa8ebc8ea3ecfa8ebc8eab5e60ecfa8ebc8ea3ecfa8ebc8eab5e4f42
UTF-8 蹂シ」蹂シォ^`蹂シ」蹂シォ^OB 1110100010111001100000101110111110111101101111001110111110111101101000111110100010111001100000101110111110111101101111001110111110111101101010110101111001100000111010001011100110000010111011111011110110111100111011111011110110100011111010001011100110000010111011111011110110111100111011111011110110101011010111100100111101000010 e8b982efbdbcefbda3e8b982efbdbcefbdab5e60e8b982efbdbcefbda3e8b982efbdbcefbdab5e4f42
UHC 蹂??蹂??^`蹂??蹂??^OB 111010111011001100111111001111111110101110110011001111110011111101011110011000001110101110110011001111110011111111101011101100110011111100111111010111100100111101000010 ebb33f3febb33f3f5e60ebb33f3febb33f3f5e4f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)