To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄μ?揖??揄??榮??轅??筌??爾 100101101110111110000011110010100011111110010111010010110011111100111111100111011000100100111111001111111001111011000100001111110011111111100111011101100011111100111111111000101010001100111111001111111000111010100010 96ef83ca3f974b3f3f9d893f3f9ec43f3fe7763f3fe2a33f3f8ea2
EUC-JP 厄μ?揖??揄?Œ榮??轅??筌??爾 1100110011110001101001101100110000111111110011011010110000111111001111111101100111101001001111111000111110101001101011011101110011000110001111110011111111101101110101110011111100111111111001001010010100111111001111111011110010100100 ccf1a6cc3fcdac3f3fd9e93f8fa9addcc63f3fedd73f3fe4a53f3fbca4
UTF-8 厄μ떝揖며몭揄앹Œ榮싷쭑轅⑸눤筌뤾쑬爾 11100101100011101000010011001110101111001110101110010110100111011110011010001111100101101110101110101001101100001110101110101010101011011110011010001111100001001110110010010101101110011100010110010010111001101010011010101110111011001000101110110111111011001010110110010001111010001011110110000101111000101001000110111000111010111000100010100100111001111010110110001100111010111010010010111110111011001001000110101100111001111000100010111110 e58e84cebceb969de68f96eba9b0ebaaade68f84ec95b9c592e6a6aeec8bb7ecad91e8bd85e291b8eb88a4e7ad8ceba4beec91ace788be
UHC 厄μ떝揖며몭揄앹Œ榮싷쭑轅⑸눤筌뤾쑬爾 1110010011111000101001011110110010001011101100111110101111100111101110001110011110010001100101111110101011110001100111011110110010101000101010111110011110110100100110101110111110100111100010011110101010111111101010011110101110000111101110111110111110100111100011111110101010111110101010001110110010110011 e4f8a5ec8bb3ebe7b8e79197eaf19deca8abe7b49aefa789eabfa9eb87bbefa78feabea8ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)