To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????A 00111111001111110011111100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f41
SJIS-WIN 要?????午ワ?A 10010111011101100011111100111111001111110011111100111111100011001101111110000011100011110011111101000001 97763f3f3f3f3f8cdf838f3f41
EUC-JP 要??濚??午ワ?A 110011011101011100111111001111111000111111001001101000010011111100111111101110001110000110100101111011110011111101000001 cdd73f3f8fc9a13f3fb8e1a5ef3f41
UTF-8 要쏉숴濚앾쉴午ワ쉰A 11101000101001101000000111101100100011111000100111101100100010001011010011100110101111111001101011101100100101011011111011101100100010011011010011100101100011011000100011100011100000111010111111101100100010011011000001000001 e8a681ec8f89ec88b4e6bf9aec95beec89b4e58d88e383afec89b041
UHC 要쏉숴濚앾쉴午ワ쉰A 11101001101010011001101111101111101111011010010011100111101110011001110111101111101111011010111111100111111011011010101111101111101111011010111001000001 e9a99befbda4e7b99defbdafe7edabefbdae41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)