To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 塢????????鴨↑?節??躍??傲??^ 10011010110001110011111100111111001111110011111100111111001111110011111100111111100010101001101110000001101010100011111110010000110111110011111100111111100101101111010000111111001111111001100011111100001111110011111101011110 9ac73f3f3f3f3f3f3f3f8a9b81aa3f90df3f3f96f43f3f98fc3f3f5e
EUC-JP 塢??縕??旿??鴨↑?節??躍??傲??^ 1101010011001001001111110011111110001111110101001100001000111111001111111000111111000001111101000011111100111111101100111111101110100010101011000011111111000000111000010011111100111111110011001111011000111111001111111101000011111110001111110011111101011110 d4c93f3f8fd4c23f3f8fc1f43f3fb3fba2ac3fc0e13f3fccf63f3fd0fe3f3f5e
UTF-8 塢곻슁縕됧막旿⑵럦鴨↑뱚節녘쾳躍룩쑊傲됪쑍^ 11100101101000011010001011101010101100111011101111101100100010101000000111100111101110001001010111101011100100001010011111101011101001111000100111100110100101111011111111100010100100011011010111101011100111111010011011101001101101001010100011100010100001101001000111101011101100011001101011100111101011111000000011101011100001011001100011101100101111101011001111101000101110101000110111101011101000111010100111101100100100011000101011100101100000101011001011101011100100001010101011101100100100011000110101011110 e5a1a2eab3bbec8a81e7b895eb90a7eba789e697bfe291b5eb9fa6e9b4a8e28691ebb19ae7af80eb8598ecbeb3e8ba8deba3a9ec918ae582b2eb90aaec918d5e
UHC 塢곻슁縕됧막旿⑵럦鴨↑뱚節녘쾳躍룩쑊傲됪쑍^ 11100111111100011000000111101111101111011011001111101000101100101000100111100101101110001011011111100111111110101010100111101000100011101000100111100100111001011010000111101000100100111000000111101111101111011011001111101000101100101000100111100101101110001011011111101000100111001010100111100111111011001000100111100110100111001010110001011110 e7f181efbdb3e8b289e5b8b7e7faa9e88e89e4e5a1e89381efbdb3e8b289e5b8b7e89ca9e7ec89e69cac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)