To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 如??狎?????絶??與??節ц?奧??^ 1001010001000000001111110011111111100000101111100011111100111111001111110011111100111111100100001110001000111111001111111110010001101111001111110011111110010000110111111000010010001000001111111001101011111010001111110011111101011110 94403f3fe0be3f3f3f3f3f90e23f3fe46f3f3f90df84883f9afa3f3f5e
EUC-JP 如??狎??邕??絶??與??節ц?奧??^ 11000111101000010011111100111111111000001100000000111111001111111000111111100001111011010011111100111111110000001110010000111111001111111110011111010000001111110011111111000000111000011010011111101000001111111101010011111100001111110011111101011110 c7a13f3fe0c03f3f8fe1ed3f3fc0e43f3fe7d03f3fc0e1a7e83fd4fc3f3f5e
UTF-8 如닸쯃狎먲쉬邕멱븡絶쎿윺與잌넀節ц뻗奧딉풖^ 111001011010011010000010111010111000101110111000111011001010111110000011111001111000101110001110111010111010100010110010111011001000100110101100111010011000001010010101111010111010100110110001111010111011100010100001111001111011010110110110111011001000111010111111111011001001110010111010111010001000100010000111111011001001111010001100111010111000010010000000111001111010111110000000110100011000011011101011101110111001011111100101101001011010011111101011100101001000100111101101100100101001011001011110 e5a682eb8bb8ecaf83e78b8eeba8b2ec89ace98295eba9b1ebb8a1e7b5b6ec8ebfec9cbae88887ec9e8ceb8480e7af80d186ebbb97e5a5a7eb9489ed92965e
UHC 如닸쯃狎먲쉬邕멱븡絶쎿윺與잌넀節ц뻗奧딉풖^ 11100101111111011011010011100110101010001001111111100100111001001001000011101111101111011010110011101000101110111011100011101000100101011000101011101111101111101001101111100110100111111011010011100110101010001001111111100101100001101001000011101111101111011010110011101000101110111011100011100111111100111000101011101111101111101001100101011110 e5fdb4e6a89fe4e490efbdace8bbb8e8958aefbe9be69fb4e6a89fe58690efbdace8bbb8e7f38aefbe995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)