To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 辰損族炭捉即炭臓村辰損族炭捉即炭臓村B 10010010010000111001000110111001100100011011000010010010010110011001000110101000100100011010011010010010010110011001000110011111100100011011101010010010010000111001000110111001100100011011000010010010010110011001000110101000100100011010011010010010010110011001000110011111100100011011101001000010 924391b991b0925991a891a69259919f91ba924391b991b0925991a891a69259919f91ba42
EUC-JP 辰損族炭捉即炭臓村辰損族炭捉即炭臓村B 11000011101001001100001010111011110000101011001011000011101110101100001010101010110000101010100011000011101110101100001010100001110000101011110011000011101001001100001010111011110000101011001011000011101110101100001010101010110000101010100011000011101110101100001010100001110000101011110001000010 c3a4c2bbc2b2c3bac2aac2a8c3bac2a1c2bcc3a4c2bbc2b2c3bac2aac2a8c3bac2a1c2bc42
UTF-8 辰損族炭捉即炭臓村辰損族炭捉即炭臓村B 11101000101111101011000011100110100100001000110111100110100101111000111111100111100000101010110111100110100011011000100111100101100011011011001111100111100000101010110111101000100001111001001111100110100111011001000111101000101111101011000011100110100100001000110111100110100101111000111111100111100000101010110111100110100011011000100111100101100011011011001111100111100000101010110111101000100001111001001111100110100111011001000101000010 e8beb0e6908de6978fe782ade68d89e58db3e782ade88793e69d91e8beb0e6908de6978fe782ade68d89e58db3e782ade88793e69d9142
UHC 辰損族炭捉?炭?村辰損族炭捉?炭?村B 111100101110001111100001110111111111000011101001111101111010100111110011101101010011111111110111101010010011111111110101101111011111001011100011111000011101111111110000111010011111011110101001111100111011010100111111111101111010100100111111111101011011110101000010 f2e3e1dff0e9f7a9f3b53ff7a93ff5bdf2e3e1dff0e9f7a9f3b53ff7a93ff5bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)