To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?訥??芝?失?鯤?訥??芝?失?鯤B 001111111110011001100011001111110011111110001110110001010011111110001110101110000011111111101001110010000011111111100110011000110011111100111111100011101100010100111111100011101011100000111111111010011100100001000010 3fe6633f3f8ec53f8eb83fe9c83fe6633f3f8ec53f8eb83fe9c842
EUC-JP 侁訥??芝?失?鯤侁訥??芝?失?鯤B 10001111101100001111110011101011110001000011111100111111101111001100011100111111101111001011101000111111111100101100101010001111101100001111110011101011110001000011111100111111101111001100011100111111101111001011101000111111111100101100101001000010 8fb0fcebc43f3fbcc73fbcba3ff2ca8fb0fcebc43f3fbcc73fbcba3ff2ca42
UTF-8 侁訥렍렖芝렫失겻鯤侁訥렍렖芝렫失겻鯤B 11100100101111101000000111101000101010001010010111101011101000001000110111101011101000001001011011101000100010101001110111101011101000001010101111100101101001001011000111101010101100101011101111101001101011111010010011100100101111101000000111101000101010001010010111101011101000001000110111101011101000001001011011101000100010101001110111101011101000001010101111100101101001001011000111101010101100101011101111101001101011111010010001000010 e4be81e8a8a5eba08deba096e88a9deba0abe5a4b1eab2bbe9afa4e4be81e8a8a5eba08deba096e88a9deba0abe5a4b1eab2bbe9afa442
UHC 侁訥렍렖芝렫失겻鯤侁訥렍렖芝렫失겻鯤B 11100011111000001101001011101101100011101010001110001110101010111111001010111001100011101011100111100011111101111011000011100100110011011110011011100011111000001101001011101101100011101010001110001110101010111111001010111001100011101011100111100011111101111011000011100100110011011110011001000010 e3e0d2ed8ea38eabf2b98eb9e3f7b0e4cde6e3e0d2ed8ea38eabf2b98eb9e3f7b0e4cde642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)