To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲????????巍ル?維???k??l?釗 1110000110011111001111110011111100111111001111110011111100111111001111110011111110011011110110011000001110001011001111111000100011011011001111110011111100111111100000101000101100111111001111111000001010001100001111111111101110111011 e19f3f3f3f3f3f3f3f3f9bd9838b3f88db3f3f3f828b3f3f828c3ffbbb
EUC-JP 癲????????巍ル?維??洹k??l?釗 1110001010100001001111110011111100111111001111110011111100111111001111110011111111010110110110111010010111101011001111111011000011011101001111110011111110001111110001111011101010100011111010110011111100111111101000111110110000111111100011111110001110100110 e2a13f3f3f3f3f3f3f3fd6dba5eb3fb0dd3f3f8fc7baa3eb3f3fa3ec3f8fe3a6
UTF-8 癲욌돂留뚩굜戮⑷뭍巍ル쵑維붹넼洹k탟力l빢釗 111001111001100110110010111011001001101010001100111010111000111110000010111011111010011110001101111010111001101010101001111010101011010110011100111011111010011110010010111000101001000110110111111010111010110110001101111001011011011110001101111000111000001110101011111011001011010110010001111001111011011010101101111010111011011010111001111010111000010010111100111001101011010010111001111011111011110110001011111011011000001110011111111011111010011010001010111011111011110110001100111010111011100110100010111010011000011110010111 e799b2ec9a8ceb8f82efa78deb9aa9eab59cefa792e291b7ebad8de5b78de383abecb591e7b6adebb6b9eb84bce6b4b9efbd8bed839fefa68aefbd8cebb9a2e98797
UHC 癲욌돂留뚩굜戮⑷뭍巍ル쵑維붹넼洹k탟力l빢釗 1110111110100110100111101110101110001001100101011110101110100111100011001110100010000010100001001110101110111101101010011110101010111001101101111110100011100100101010111110101110101100100100111110101110101011100101001110011010000110101101101110101010110111101000111110101110110101100000111110011010110011101000111110110010010101101111101110000111110010 efa69eeb8995eba78ce88284ebbda9eab9b7e8e4abebac93ebab94e686b6eab7a3ebb583e6b3a3ec95bee1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)