To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 葉??揶?????鴨?=葉??揶?????鴨?=B 100101110111010000111111001111111001110110001000001111110011111100111111001111110011111110001010100110110011111110000001100000011001011101110100001111110011111110011101100010000011111100111111001111110011111100111111100010101001101100111111100000011000000101000010 97743f3f9d883f3f3f3f3f8a9b3f818197743f3f9d883f3f3f3f3f8a9b3f818142
EUC-JP 葉??揶?????鴨?=葉??揶?????鴨?=B 110011011101010100111111001111111101100111101000001111110011111100111111001111110011111110110011111110110011111110100001111000011100110111010101001111110011111111011001111010000011111100111111001111110011111100111111101100111111101100111111101000011110000101000010 cdd53f3fd9e83f3f3f3f3fb3fb3fa1e1cdd53f3fd9e83f3f3f3f3fb3fb3fa1e142
UTF-8 葉뗫젚揶쏄낏溜곕젩鴨앹=葉뗫젚揶쏄낏溜곕젩鴨앹=B 11101000100100011000100111101011100101111010101111101100101000001001101011100110100011111011011011101100100011111000010011101011100000101000111111101111101001111000101111101010101100111001010111101100101000001010100111101001101101001010100011101100100101011011100111101111101111001001110111101000100100011000100111101011100101111010101111101100101000001001101011100110100011111011011011101100100011111000010011101011100000101000111111101111101001111000101111101010101100111001010111101100101000001010100111101001101101001010100011101100100101011011100111101111101111001001110101000010 e89189eb97abeca09ae68fb6ec8f84eb828fefa78beab395eca0a9e9b4a8ec95b9efbc9de89189eb97abeca09ae68fb6ec8f84eb828fefa78beab395eca0a9e9b4a8ec95b9efbc9d42
UHC 葉뗫젚揶쏄낏溜곕젩鴨앹=葉뗫젚揶쏄낏溜곕젩鴨앹=B 11100111101010001000101111101011101000001001011011100101101010101001101111101010101100111010100011101010111111101011000011101011101000001010000111100100111001011001110111101100101000111011110111100111101010001000101111101011101000001001011011100101101010101001101111101010101100111010100011101010111111101011000011101011101000001010000111100100111001011001110111101100101000111011110101000010 e7a88beba096e5aa9beab3a8eafeb0eba0a1e4e59deca3bde7a88beba096e5aa9beab3a8eafeb0eba0a1e4e59deca3bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)