To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 遭??釵い???? 100100011001100000111111001111111110011111011110100000101010001000111111001111110011111100111111 91983f3fe7de82a23f3f3f3f
EUC-JP 遭??釵い???? 110000011111100000111111001111111110111011100000101001001010010000111111001111110011111100111111 c1f83f3feee0a4a43f3f3f3f
UTF-8 遭솥양釵い씻렼솽셥 111010011000000110101101111011001000011010100101111011001001011010010001111010011000011110110101111000111000000110000100111011001001010010111011111010111010000010111100111011001000011010111101111011001000010110100101 e981adec86a5ec9691e987b5e38184ec94bbeba0bcec86bdec85a5
UHC 遭솥양釵い씻렼솽셥 111100001110010010111100110111001011111011100111111100111111101110101010101001001011111011000100100011101100010010111100111000011011110011001010 f0e4bcdcbee7f3fbaaa4bec48ec4bce1bcca

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)