To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋や???オ餓??渦??押?オ???也よ? 100110101100100010000010111000100011111100111111001111111000001101001001100010011110110000111111001111111000100101010001001111110011111110001001100111110011111110000011010010010011111100111111001111111001011011100111100000101110011000111111 9ac882e23f3f3f834989ec3f3f89513f3f899f3f83493f3f3f96e782e63f
EUC-JP 塋や???オ餓??渦??押?オ???也よ? 110101001100101010100100111001000011111100111111001111111010010110101010101100101110111000111111001111111011000110110010001111110011111110110010101000010011111110100101101010100011111100111111001111111100110011101001101001001110100000111111 d4caa4e43f3f3fa5aab2ee3f3fb1b23f3fb2a13fa5aa3f3f3fcce9a4e83f
UTF-8 塋や퍟掠욅オ餓뽩콪渦곈걫押띄オ掠욄룂也よ퀕 111001011010000110001011111000111000001010000100111011011000110110011111111011111010010110110101111011001001101010000101111000111000001010101010111010011010010010010011111010111011110110101001111011001011110110101010111001101011100010100110111010101011001110001000111010101011000110101011111001101000101010111100111010111001110110000100111000111000001010101010111011111010010110110101111011001001101010000100111010111010001110000010111001001011100110011111111000111000001010001000111011011000000010010101 e5a18be38284ed8d9fefa5b5ec9a85e382aae9a493ebbda9ecbdaae6b8a6eab388eab1abe68abceb9d84e382aaefa5b5ec9a84eba382e4b99fe38288ed8095
UHC 塋や퍟掠욅オ餓뽩콪渦곈걫押띄オ掠욄룂也よ퀕 111001111010101110101010111001001011101110010110111001011011000110011110111001111010101110101010111001001011101110010110111001011011000110011110111010001011111010110000111010011000000110010100111001001110001110110110111001111010101110101010111001011011000110011110111001101000111110000011111001011010010110101010111010001011001110001010 e7abaae4bb96e5b19ee7abaae4bb96e5b19ee8beb0e98194e4e3b6e7abaae5b19ee68f83e5a5aae8b38a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)