To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 爰?除漑爰?除改N}爰?除漑爰?除改N{^ 111000001010011100111111100011111001110010011111111100101110000010100111001111111000111110011100100010011111110001001110011111011110000010100111001111111000111110011100100111111111001011100000101001110011111110001111100111001000100111111100010011100111101101011110 e0a73f8f9c9ff2e0a73f8f9c89fc4e7de0a73f8f9c9ff2e0a73f8f9c89fc4e7b5e
EUC-JP 爰?除漑爰?除改N}爰?除漑爰?除改N{^ 111000001010100100111111101111011111110011011110111101001110000010101001001111111011110111111100101100101111111001001110011111011110000010101001001111111011110111111100110111101111010011100000101010010011111110111101111111001011001011111110010011100111101101011110 e0a93fbdfcdef4e0a93fbdfcb2fe4e7de0a93fbdfcdef4e0a93fbdfcb2fe4e7b5e
UTF-8 爰렚除漑爰렚除改N}爰렚除漑爰렚除改N{^ 1110011110001000101100001110101110100000100110101110100110011001101001001110011010111100100100011110011110001000101100001110101110100000100110101110100110011001101001001110011010010100101110010100111001111101111001111000100010110000111010111010000010011010111010011001100110100100111001101011110010010001111001111000100010110000111010111010000010011010111010011001100110100100111001101001010010111001010011100111101101011110 e788b0eba09ae999a4e6bc91e788b0eba09ae999a4e694b94e7de788b0eba09ae999a4e6bc91e788b0eba09ae999a4e694b94e7b5e
UHC 爰렚除漑爰렚除改N}爰렚除漑爰렚除改N{^ 11101010101110101000111010101101111100001011011011001011110010011110101010111010100011101010110111110000101101101100101111000111010011100111110111101010101110101000111010101101111100001011011011001011110010011110101010111010100011101010110111110000101101101100101111000111010011100111101101011110 eaba8eadf0b6cbc9eaba8eadf0b6cbc74e7deaba8eadf0b6cbc9eaba8eadf0b6cbc74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)