To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遙??節???????ゴ擁o?節??繹??^ 1110101010100001001111110011111110010000110111110011111100111111001111110011111100111111001111110011111110000011010100111001011101101001100000101000111100111111100100001101111100111111001111111110001110001000001111110011111101011110 eaa13f3f90df3f3f3f3f3f3f3f83539769828f3f90df3f3fe3883f3f5e
EUC-JP 遙??節???????ゴ擁o?節??繹??^ 1111010010100011001111110011111111000000111000010011111100111111001111110011111100111111001111110011111110100101101101001100110111001010101000111110111100111111110000001110000100111111001111111110010111101000001111110011111101011110 f4a33f3fc0e13f3f3f3f3f3f3fa5b4cdcaa3ef3fc0e13f3fe5e83f3f5e
UTF-8 遙닺땃節쏙슴料욄볜殮닻ゴ擁o슬節욥솤繹루쎍^ 11101001100000011001100111101011100010111011101011101011100101011000001111100111101011111000000011101100100011111001100111101100100010101011010011101111101001101011111011101100100110101000010011101011101100111001110011101111101001101010010111101011100010111011101111100011100000101011010011100110100100111000000111101111101111011000111111101100100010101010110011100111101011111000000011101100100110101010010111101100100001101010010011100111101110011011100111101011101000111010100011101100100011101000110101011110 e98199eb8bbaeb9583e7af80ec8f99ec8ab4efa6beec9a84ebb39cefa6a5eb8bbbe382b4e69381efbd8fec8aace7af80ec9aa5ec86a4e7b9b9eba3a8ec8e8d5e
UHC 遙닺땃節쏙슴料욄볜殮닻ゴ擁o슬節욥솤繹루쎍^ 11101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101000111101111001111011100110101110101011011111100110111110011011010011101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101001100110011001111011100110101110101011011111100111100110111011010001011110 e9abb4e8b6a3efbdbdefbdbfe8f79ee6bab7e6f9b4e9abb4e8b6a3efbdbdefbdbfe9999ee6bab7e79bb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)