To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 諸???諸???n}諸???諸???n{^ 10001111100101000011111100111111001111111000111110010100001111110011111100111111011011100111110110001111100101000011111100111111001111111000111110010100001111110011111100111111011011100111101101011110 8f943f3f3f8f943f3f3f6e7d8f943f3f3f8f943f3f3f6e7b5e
EUC-JP 諸?頊?諸?頊?n}諸?頊?諸?頊?n{^ 101111011111010000111111100011111110011111110100001111111011110111110100001111111000111111100111111101000011111101101110011111011011110111110100001111111000111111100111111101000011111110111101111101000011111110001111111001111111010000111111011011100111101101011110 bdf43f8fe7f43fbdf43f8fe7f43f6e7dbdf43f8fe7f43fbdf43f8fe7f43f6e7b5e
UTF-8 諸렪頊타諸렪頊큰n}諸렪頊타諸렪頊큰n{^ 1110100010101011101110001110101110100000101010101110100110100000100010101110110110000011100000001110100010101011101110001110101110100000101010101110100110100000100010101110110110000001101100000110111001111101111010001010101110111000111010111010000010101010111010011010000010001010111011011000001110000000111010001010101110111000111010111010000010101010111010011010000010001010111011011000000110110000011011100111101101011110 e8abb8eba0aae9a08aed8380e8abb8eba0aae9a08aed81b06e7de8abb8eba0aae9a08aed8380e8abb8eba0aae9a08aed81b06e7b5e
UHC 諸렪頊타諸렪頊큰n}諸렪頊타諸렪頊큰n{^ 11110000101100111000111010111000111010011111010111000101101110001111000010110011100011101011100011101001111101011100010110101011011011100111110111110000101100111000111010111000111010011111010111000101101110001111000010110011100011101011100011101001111101011100010110101011011011100111101101011110 f0b38eb8e9f5c5b8f0b38eb8e9f5c5ab6e7df0b38eb8e9f5c5b8f0b38eb8e9f5c5ab6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)