To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟??????誼??節??厄β?爰??誘μ? 100011001110010100111111001111110011111100111111001111110011111110001011011000100011111100111111100100001101111100111111001111111001011011101111100000111100000000111111111000001010011100111111001111111001011101010101100000111100101000111111 8ce53f3f3f3f3f3f8b623f3f90df3f3f96ef83c03fe0a73f3f975583ca3f
EUC-JP 悟??佾??瓘誼??節??厄β?爰??誘μ? 10111000111001110011111100111111100011111011000011111011001111110011111110001111110011001110111110110101110000110011111100111111110000001110000100111111001111111100110011110001101001101100001000111111111000001010100100111111001111111100110110110110101001101100110000111111 b8e73f3f8fb0fb3f3f8fccefb5c33f3fc0e13f3fccf1a6c23fe0a93f3fcdb6a6cc3f
UTF-8 悟귣쓷佾딀뤃瓘誼붹에節뗮떐厄β뼯爰껆쳥誘μ땡 11100110100000101001111111101010101101111010001111101100100100111011011111100100101111011011111011101011100101001000000011101011101001001000001111100111100100111001100011101000101010101011110011101011101101101011100111101100100101111001000011100111101011111000000011101011100101111010111011101011100101101001000011100101100011101000010011001110101100101110101110111100101011111110011110001000101100001110101010111011100001101110110010110011101001011110100010101010100110001100111010111100111010111001010110100001 e6829feab7a3ec93b7e4bdbeeb9480eba483e79398e8aabcebb6b9ec9790e7af80eb97aeeb9690e58e84ceb2ebbcafe788b0eabb86ecb3a5e8aa98cebceb95a1
UHC 悟귣쓷佾딀뤃瓘誼붹에節뗮떐厄β뼯爰껆쳥誘μ땡 1110011111110110100000101110101110011101100101001110110011101011100010101110011010001111101101001100111010110110111010111111111010010100111001101011111110100001111011111011110110001011111011011000101110100110111001001111100010100101111000101001011010110010111010101011101010000011111001111010101110001010111010111010111110100101111011001011011010101111 e7f682eb9d94eceb8ae68fb4ceb6ebfe94e6bfa1efbd8bed8ba6e4f8a5e296b2eaba83e7ab8aebafa5ecb6af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)