To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 曜??要??獰↓?節??檍??矮??節 100101110110101000111111001111111001011101110110001111110011111111100000110101101000000110101011001111111001000011011111001111110011111110011110111110000011111100111111111000011110001000111111001111111001000011011111 976a3f3f97763f3fe0d681ab3f90df3f3f9ef83f3fe1e23f3f90df
EUC-JP 曜??要??獰↓?節??檍??矮??節 110011011100101100111111001111111100110111010111001111110011111111100000110110001010001010101101001111111100000011100001001111110011111111011100111110100011111100111111111000101110010000111111001111111100000011100001 cdcb3f3fcdd73f3fe0d8a2ad3fc0e13f3fdcfa3f3fe2e43f3fc0e1
UTF-8 曜깍쉼要뺞눟獰↓슧節곈옖檍놅슈矮곻쉠節 111001101001101110011100111010101011100110001101111011001000100110111100111010001010011010000001111010111011101010011110111010111000100010011111111001111000110110110000111000101000011010010011111011001000101010100111111001111010111110000000111010101011001110001000111011001001100010010110111001101010101010001101111010111000011010000101111011001000101010001000111001111001111110101110111010101011001110111011111011001000100110100000111001111010111110000000 e69b9ceab98dec89bce8a681ebba9eeb889fe78db0e28693ec8aa7e7af80eab388ec9896e6aa8deb8685ec8a88e79faeeab3bbec89a0e7af80
UHC 曜깍쉼要뺞눟獰↓슧節곈옖檍놅슈矮곻쉠節 1110100011111000101100011110111110111101101100001110100110101001100101011110011010000111101101111110011110111110101000011110100110011010101100011110111110111101101100001110100110011110100111001110010111100101100001101110111110111101101101001110100011100001100000011110111110111101101010101110111110111101 e8f8b1efbdb0e9a995e687b7e7bea1e99ab1efbdb0e99e9ce5e586efbdb4e8e181efbdaaefbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)