To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊?垣亘????寃???賊?垣亘????寃???^ 100100011010111100111111100010100101111110011000011010100011111100111111001111110011111110011011100000110011111100111111001111111001000110101111001111111000101001011111100110000110101000111111001111110011111100111111100110111000001100111111001111110011111101011110 91af3f8a5f986a3f3f3f3f9b833f3f3f91af3f8a5f986a3f3f3f3f9b833f3f3f5e
EUC-JP 賊?垣亘????寃???賊?垣亘????寃???^ 110000101011000100111111101100111100000011001111110010110011111100111111001111110011111111010101111000110011111100111111001111111100001010110001001111111011001111000000110011111100101100111111001111110011111100111111110101011110001100111111001111110011111101011110 c2b13fb3c0cfcb3f3f3f3fd5e33f3f3fc2b13fb3c0cfcb3f3f3f3fd5e33f3f3f5e
UTF-8 賊렠垣亘롛렣欌렪寃닿렢난賊렠垣亘롛렣欌렪寃닿렢난^ 11101000101100111000101011101011101000001010000011100101100111101010001111100100101110101001100011101011101000011001101111101011101000001010001111100110101011001000110011101011101000001010101011100101101011111000001111101011100010111011111111101011101000001010001011101011100000101001110011101000101100111000101011101011101000001010000011100101100111101010001111100100101110101001100011101011101000011001101111101011101000001010001111100110101011001000110011101011101000001010101011100101101011111000001111101011100010111011111111101011101000001010001011101011100000101001110001011110 e8b38aeba0a0e59ea3e4ba98eba19beba0a3e6ac8ceba0aae5af83eb8bbfeba0a2eb829ce8b38aeba0a0e59ea3e4ba98eba19beba0a3e6ac8ceba0aae5af83eb8bbfeba0a2eb829c5e
UHC 賊렠垣亘롛렣欌렪寃닿렢난賊렠垣亘롛렣欌렪寃닿렢난^ 11101110111001001000111010110001111010101010111111010000111001101000111011011111100011101011010011101101111010111000111010111000111010101011001010110100111010101000111010110011101100111010110111101110111001001000111010110001111010101010111111010000111001101000111011011111100011101011010011101101111010111000111010111000111010101011001010110100111010101000111010110011101100111010110101011110 eee48eb1eaafd0e68edf8eb4edeb8eb8eab2b4ea8eb3b3adeee48eb1eaafd0e68edf8eb4edeb8eb8eab2b4ea8eb3b3ad5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)