To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 曜?????儒??}v曜?????儒??}vB 100101110110101000111111001111110011111100111111001111111000111011110010001111110011111101111101011101101001011101101010001111110011111100111111001111110011111110001110111100100011111100111111011111010111011001000010 976a3f3f3f3f3f8ef23f3f7d76976a3f3f3f3f3f8ef23f3f7d7642
EUC-JP 曜?????儒??}v曜?????儒??}vB 110011011100101100111111001111110011111100111111001111111011110011110100001111110011111101111101011101101100110111001011001111110011111100111111001111110011111110111100111101000011111100111111011111010111011001000010 cdcb3f3f3f3f3fbcf43f3f7d76cdcb3f3f3f3f3fbcf43f3f7d7642
UTF-8 曜섎뀿留롩슆儒듬쭖}v曜섎뀿留롩슆儒듬쭖}vB 1110011010011011100111001110110010000100100011101110101110000000101111111110111110100111100011011110101110100001101010011110110010001010100001101110010110000100100100101110101110010011101011001110110010101101100101100111110101110110111001101001101110011100111011001000010010001110111010111000000010111111111011111010011110001101111010111010000110101001111011001000101010000110111001011000010010010010111010111001001110101100111011001010110110010110011111010111011001000010 e69b9cec848eeb80bfefa78deba1a9ec8a86e58492eb93acecad967d76e69b9cec848eeb80bfefa78deba1a9ec8a86e58492eb93acecad967d7642
UHC 曜섎뀿留롩슆儒듬쭖}v曜섎뀿留롩슆儒듬쭖}vB 1110100011111000100110001110101110000101101101011110101110100111100011101110100110011010100110001110101011100011101101011110101110100111100011100111110101110110111010001111100010011000111010111000010110110101111010111010011110001110111010011001101010011000111010101110001110110101111010111010011110001110011111010111011001000010 e8f898eb85b5eba78ee99a98eae3b5eba78e7d76e8f898eb85b5eba78ee99a98eae3b5eba78e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)