To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??節o?罌??饒??碍х?餓??節 10011000011000000011111100111111100100001101111110000010100011110011111111100011101000000011111100111111111010010110000000111111001111111000101001010110100001001000011100111111100010011110110000111111001111111001000011011111 98603f3f90df828f3fe3a03f3fe9603f3f8a5684873f89ec3f3f90df
EUC-JP 倭??節o?罌??饒??碍х?餓??節 11001111110000010011111100111111110000001110000110100011111011110011111111100110101000100011111100111111111100011100000100111111001111111011001110110111101001111110011100111111101100101110111000111111001111111100000011100001 cfc13f3fc0e1a3ef3fe6a23f3ff1c13f3fb3b7a7e73fb2ee3f3fc0e1
UTF-8 倭뽪퍝節o쉘罌욑슝饒묈ㄵ碍х맏餓숋숴節 1110010110000000101011011110101110111101101010101110110110001101100111011110011110101111100000001110111110111101100011111110110010001001100110001110011110111101100011001110110010011010100100011110110010001010100111011110100110100101100100101110101110101100100010001110001110000100101101011110011110100010100011011101000110000101111010111010011110001111111010011010010010010011111011001000100010001011111011001000100010110100111001111010111110000000 e580adebbdaaed8d9de7af80efbd8fec8998e7bd8cec9a91ec8a9de9a592ebac88e384b5e7a28dd185eba78fe9a493ec888bec88b4e7af80
UHC 倭뽪퍝節o쉘罌욑슝饒묈ㄵ碍х맏餓숋숴節 1110100011011110100101101110011010111011100101001110111110111101101000111110111110111101101010011110010110100010100111101110111110111101101110011110100110101110100100011110010110100100101001011110010011110100101011001110011110111000101110101110010010111011100110011110111110111101101001001110111110111101 e8de96e6bb94efbda3efbda9e5a29eefbdb9e9ae91e5a4a5e4f4ace7b8bae4bb99efbda4efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)