To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?低頭?屯??瀞??製?低?儀???紆? 100111111100010000111111100100101110000110010011101010100011111110010011110101000011111100111111100100111101001000111111001111111001000010111011001111111001001011100001001111111000101101010110001111110011111100111111111000101111110000111111 9fc43f92e193aa3f93d43f3f93d23f3f90bb3f92e13f8b563f3f3fe2fc3f
EUC-JP 淨?低頭?屯??瀞?汶製?低?儀???紆? 1101111011000110001111111100010011100011110001101010110000111111110001101101011000111111001111111100011011010100001111111000111111000110111001011100000010111101001111111100010011100011001111111011010110110111001111110011111100111111111001001111111000111111 dec63fc4e3c6ac3fc6d63f3fc6d43f8fc6e5c0bd3fc4e33fb5b73f3f3fe4fe3f
UTF-8 淨렠低頭렧屯렟렩瀞펨汶製렩低렮儀브렕렟紆렣 111001101011011110101000111010111010000010100000111001001011110110001110111010011010000010101101111010111010000010100111111001011011000110101111111010111010000010011111111010111010000010101001111001111000000010011110111011011000111010101000111001101011000110110110111010001010001110111101111010111010000010101001111001001011110110001110111010111010000010101110111001011000010010000000111010111011100010001100111010111010000010010101111010111010000010011111111001111011010010000110111010111010000010100011 e6b7a8eba0a0e4bd8ee9a0adeba0a7e5b1afeba09feba0a9e7809eed8ea8e6b1b6e8a3bdeba0a9e4bd8eeba0aee58480ebb88ceba095eba09fe7b486eba0a3
UHC 淨렠低頭렧屯렟렩瀞펨汶製렩低렮儀브렕렟紆렣 111011111110010010001110101100011110111010111000110101001110100110001110101101101101010011101010100011101011000010001110101101111110111111100111110001101110100011011010101000011111000010110010100011101011011111101110101110001000111010111011111010111111000010111010111010101000111010101010100011101011000011101001111000011000111010110100 efe48eb1eeb8d4e98eb6d4ea8eb08eb7efe7c6e8daa1f0b28eb7eeb88ebbebf0baea8eaa8eb0e9e18eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)