To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蒸基??甑???咀???蒸基??甑???咀???^ 100011111111011010001010111011100011111100111111100011011001100100111111001111110011111110011001111100000011111100111111001111111000111111110110100010101110111000111111001111111000110110011001001111110011111100111111100110011111000000111111001111110011111101011110 8ff68aee3f3f8d993f3f3f99f03f3f3f8ff68aee3f3f8d993f3f3f99f03f3f3f5e
EUC-JP 蒸基??甑???咀?橒?蒸基??甑???咀?橒?^ 10111110111110001011010011110000001111110011111110111001111110010011111100111111001111111101001011110010001111111000111111000101101011010011111110111110111110001011010011110000001111110011111110111001111110010011111100111111001111111101001011110010001111111000111111000101101011010011111101011110 bef8b4f03f3fb9f93f3f3fd2f23f8fc5ad3fbef8b4f03f3fb9f93f3f3fd2f23f8fc5ad3f5e
UTF-8 蒸基렰렚甑비렰렑咀렭橒렡蒸基렰렚甑비렰렑咀렭橒렣^ 11101000100100101011100011100101100111111011101011101011101000001011000011101011101000001001101011100111100101001001000111101011101110011000010011101011101000001011000011101011101000001001000111100101100100101000000011101011101000001010110111100110101010011001001011101011101000001010000111101000100100101011100011100101100111111011101011101011101000001011000011101011101000001001101011100111100101001001000111101011101110011000010011101011101000001011000011101011101000001001000111100101100100101000000011101011101000001010110111100110101010011001001011101011101000001010001101011110 e892b8e59fbaeba0b0eba09ae79491ebb984eba0b0eba091e59280eba0ade6a992eba0a1e892b8e59fbaeba0b0eba09ae79491ebb984eba0b0eba091e59280eba0ade6a992eba0a35e
UHC 蒸基렰렚甑비렰렑咀렭橒렡蒸基렰렚甑비렰렑咀렭橒렣^ 11110001111110101101000011110001100011101011110110001110101011011111000111110111101110101111000110001110101111011000111010100110111011101011101010001110101110101110100111111000100011101011001011110001111110101101000011110001100011101011110110001110101011011111000111110111101110101111000110001110101111011000111010100110111011101011101010001110101110101110100111111000100011101011010001011110 f1fad0f18ebd8eadf1f7baf18ebd8ea6eeba8ebae9f88eb2f1fad0f18ebd8eadf1f7baf18ebd8ea6eeba8ebae9f88eb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)