To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??銀??語⑤?松??????? 00111111001111110011111111101000111010000011111100111111100010111110001000111111001111111000110011101010100001110100010000111111100011111011110000111111001111110011111100111111001111110011111100111111 3f3f3fe8e83f3f8be23f3f8cea87443f8fbc3f3f3f3f3f3f3f
EUC-JP ???韋??銀??語??松??????? 001111110011111100111111111100001110101000111111001111111011011011100100001111110011111110111000111011000011111100111111101111101011111000111111001111110011111100111111001111110011111100111111 3f3f3ff0ea3f3fb6e43f3fb8ec3f3fbebe3f3f3f3f3f3f3f
UTF-8 僚녹뼔韋귛푻銀㏓븶語⑤줎松쎌춻列룸챷璘쬌 111011111010011010111011111010111000010110111001111010111011110010010100111010011001111110001011111010101011011110011011111011011001000110111011111010011000101010000000111000111000111110010011111010111011100010110110111010001010101010011110111000101001000110100100111011001010010010001110111001101001110110111110111011001000111010001100111011001011011010111011111011111010011010011100111010111010001110111000111011001011000110110111111011111010011110101111111011001010110010001100 efa6bbeb85b9ebbc94e99f8beab79bed91bbe98a80e38f93ebb8b6e8aa9ee291a4eca48ee69dbeec8e8cecb6bbefa69ceba3b8ecb1b7efa7afecac8c
UHC 僚녹뼔韋귛푻銀㏓븶語⑤줎松쎌춻列룸챷璘쬌 11101000111010001011001111101100100101101001110011101010110111111000001011100101101111101000011111101011110111101010011111101011100101011001111111100101110111101010100011101011101000011010000011100001111001101011110111101100101011011001011111100110111010101011011111101011101010101000010011101100110111101010011101000010 e8e8b3ec969ceadf82e5be87ebdea7eb959fe5dea8eba1a0e1e6bdecad97e6eab7ebaa84ecdea742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)