To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??宜??惟る?冗??癲??宜??惟る?冗??^ 1110000110011111001111110011111110001011010110000011111100111111100010001101001010000010111010010011111110001111111001110011111100111111111000011001111100111111001111111000101101011000001111110011111110001000110100101000001011101001001111111000111111100111001111110011111101011110 e19f3f3f8b583f3f88d282e93f8fe73f3fe19f3f3f8b583f3f88d282e93f8fe73f3f5e
EUC-JP 癲??宜??惟る?冗??癲??宜??惟る?冗??^ 1110001010100001001111110011111110110101101110010011111100111111101100001101010010100100111010110011111110111110111010010011111100111111111000101010000100111111001111111011010110111001001111110011111110110000110101001010010011101011001111111011111011101001001111110011111101011110 e2a13f3fb5b93f3fb0d4a4eb3fbee93f3fe2a13f3fb5b93f3fb0d4a4eb3fbee93f3f5e
UTF-8 癲덈챶宜룬씘惟る쿅冗뱀뼓癲덈챶宜룬씘惟る쿅冗뱀뼓^ 11100111100110011011001011101011100011011000100011101100101100011011011011100101101011101001110011101011101000111010110011101100100101001001100011100110100000111001111111100011100000101000101111101100101111111000010111100101100001101001011111101011101100011000000011101011101111001001001111100111100110011011001011101011100011011000100011101100101100011011011011100101101011101001110011101011101000111010110011101100100101001001100011100110100000111001111111100011100000101000101111101100101111111000010111100101100001101001011111101011101100011000000011101011101111001001001101011110 e799b2eb8d88ecb1b6e5ae9ceba3acec9498e6839fe3828becbf85e58697ebb180ebbc93e799b2eb8d88ecb1b6e5ae9ceba3acec9498e6839fe3828becbf85e58697ebb180ebbc935e
UHC 癲덈챶宜룬씘惟る쿅冗뱀뼓癲덈챶宜룬씘惟る쿅冗뱀뼓^ 11101111101001101000100011101011101010101000001111101011111100011011011111101001100111011010110111101010111011101010101011101011101100101001101011101001101101111011100111101100100101101001101111101111101001101000100011101011101010101000001111101011111100011011011111101001100111011010110111101010111011101010101011101011101100101001101011101001101101111011100111101100100101101001101101011110 efa688ebaa83ebf1b7e99dadeaeeaaebb29ae9b7b9ec969befa688ebaa83ebf1b7e99dadeaeeaaebb29ae9b7b9ec969b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)