To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臾?П維?Э維?た臾??維?Э維?た^ 111001000110101100111111100001000101000010001000110110110011111110000100010111101000100011011011001111111000001010111101111001000110101100111111001111111000100011011011001111111000010001011110100010001101101100111111100000101011110101011110 e46b3f845088db3f845e88db3f82bde46b3f3f88db3f845e88db3f82bd5e
EUC-JP 臾?П維?Э維?た臾??維?Э維?た^ 111001111100110000111111101001111011000110110000110111010011111110100111101111111011000011011101001111111010010010111111111001111100110000111111001111111011000011011101001111111010011110111111101100001101110100111111101001001011111101011110 e7cc3fa7b1b0dd3fa7bfb0dd3fa4bfe7cc3f3fb0dd3fa7bfb0dd3fa4bf5e
UTF-8 臾곕П維뽯Э維쏅た臾곕춮維볥Э維쏅た^ 11101000100001111011111011101010101100111001010111010000100111111110011110110110101011011110101110111101101011111101000010101101111001111011011010101101111011001000111110000101111000111000000110011111111010001000011110111110111010101011001110010101111011001011011010101110111001111011011010101101111010111011001110100101110100001010110111100111101101101010110111101100100011111000010111100011100000011001111101011110 e887beeab395d09fe7b6adebbdafd0ade7b6adec8f85e3819fe887beeab395ecb6aee7b6adebb3a5d0ade7b6adec8f85e3819f5e
UHC 臾곕П維뽯Э維쏅た臾곕춮維볥Э維쏅た^ 11101011101011001011000011101011101011001011000111101011101010111001011011101011101011001011111111101011101010111001101111101011101010101011111111101011101011001011000011101011101011011000101111101011101010111001001111101011101011001011111111101011101010111001101111101011101010101011111101011110 ebacb0ebacb1ebab96ebacbfebab9bebaabfebacb0ebad8bebab93ebacbfebab9bebaabf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)