To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈔ア螂ェ螟夊ーキ蓁ウ蜊ウ譽壽ュ主夋霎ソ 111001111110001010110001111001011010010110101010111001011010010010011010111010001011000010110111111001001110101110110011111001011000110110110011111001101010001110011010111001101010110110001110111001011111101010011111111010001011111010111111 e7e2b1e5a5aae5a49ae8b0b7e4ebb3e58db3e6a39ae6ad8ee5fa9fe8bebf
EUC-JP 鈔ア螂ェ螟夊ーキ蓁ウ蜊ウ譽壽ュ主夋霎ソ 111011101110010010001110101100011110101010100111100011101010101011101010101001101101010011101010100011101011000010001110101101111110100011101101100011101011001111101001111011011000111010110011111011001010010111010100111010001000111010101101101111001110011110001111101110001110000111110000110000001000111010111111 eee48eb1eaa78eaaeaa6d4ea8eb08eb7e8ed8eb3e9ed8eb3eca5d4e88eadbce78fb8e1f0c08ebf
UTF-8 鈔ア螂ェ螟夊ーキ蓁ウ蜊ウ譽壽ュ主夋霎ソ 111010011000100010010100111011111011110110110001111010001001111010000010111011111011110110101010111010001001111010011111111001011010010010001010111011111011110110110000111011111011110110110111111010001001001110000001111011111011110110110011111010001001110010001010111011111011110110110011111010001010110110111101111001011010001110111101111011111011110110101101111001001011100010111011111001011010010010001011111010011001110010001110111011111011110110111111 e98894efbdb1e89e82efbdaae89e9fe5a48aefbdb0efbdb7e89381efbdb3e89c8aefbdb3e8adbde5a3bdefbdade4b8bbe5a48be99c8eefbdbf
UHC ??螂?螟???????譽壽?主??? 001111110011111111010101110011000011111111011001101011010011111100111111001111110011111100111111001111110011111111100111111000101110000111111000001111111111000110101011001111110011111100111111 3f3fd5cc3fd9ad3f3f3f3f3f3f3fe7e2e1f83ff1ab3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)