To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??厓???э?節ц?窈??腋??玉 111110111100010000111111001111111111101010001101001111110011111100111111100001001000111100111111100100001101111110000100100010000011111111100010011101110011111100111111111000111111110000111111001111111000101111001010 fbc43f3ffa8d3f3f3f848f3f90df84883fe2773f3fe3fc3f3f8bca
EUC-JP 鈺??厓???э?節ц?窈??腋??玉 1000111111100011110101010011111100111111100011111011010011000111001111110011111100111111101001111110111100111111110000001110000110100111111010000011111111100011110110000011111100111111111001101111111000111111001111111011011011001100 8fe3d53f3f8fb4c73f3f3fa7ef3fc0e1a7e83fe3d83f3fe6fe3f3fb6cc
UTF-8 鈺싧ㄷ厓됭춾寧э슛節ц춾窈뚳슈腋잍릹玉 11101001100010001011101011101100100010111010011111100011100001001011011111100101100011101001001111101011100100001010110111101100101101101011111011101111101001101010101011010001100011011110110010001010100110111110011110101111100000001101000110000110111011001011011010111110111001111010101010001000111010111001101010110011111011001000101010001000111010001000010110001011111011001001111010001101111010111010011010111001111001111000111010001001 e988baec8ba7e384b7e58e93eb90adecb6beefa6aad18dec8a9be7af80d186ecb6bee7aa88eb9ab3ec8a88e8858bec9e8deba6b9e78e89
UHC 鈺싧ㄷ厓됭춾寧э슛節ц춾窈뚳슈腋잍릹玉 1110100010101101100110101110010110100100101001111110010011101101100010011110100010101101100110101110011110101100101011001110111110111101101110001110111110111101101011001110100010101101100110101110100110100001100011001110111110111101101101001110010011111101100111111110011010010000100101111110100010101100 e8ad9ae5a4a7e4ed89e8ad9ae7acacefbdb8efbdace8ad9ae9a18cefbdb4e4fd9fe69097e8ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)