To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??泣??袁れ?鸚??肄∽????冶 111010010110011000111111001111111000101110000011001111110011111111100101110011011000001011101010001111111110101001011111001111110011111111100011111001011000000111100100001111110011111100111111001111111001011011101000 e9663f3f8b833f3fe5cd82ea3fea5f3f3fe3e581e43f3f3f3f96e8
EUC-JP 馭??泣??袁れ?鸚??肄∽????冶 111100011100011100111111001111111011010111100011001111110011111111101010110011111010010011101100001111111111001111000000001111110011111111100110111001111010001011100110001111110011111100111111001111111100110011101010 f1c73f3fb5e33f3feacfa4ec3ff3c03f3fe6e7a2e63f3f3f3fccea
UTF-8 馭곥룊泣쒙㏊袁れ궒鸚룐뫗肄∽쭪戮곗췅冶 111010011010011010101101111010101011001110100101111010111010001110001010111001101011001110100011111011001001001010011001111000111000111110001010111010001010001010000001111000111000001010001100111010101011011010010010111010011011100010011010111010111010001110010000111010111010101110010111111010001000001010000100111000101000100010111101111011001010110110101010111011111010011110010010111010101011001110010111111011001011011110000101111001011000011010110110 e9a6adeab3a5eba38ae6b3a3ec9299e38f8ae8a281e3828ceab692e9b89aeba390ebab97e88284e288bdecadaaefa792eab397ecb785e586b6
UHC 馭곥룊泣쒙㏊袁れ궒鸚룐뫗肄∽쭪戮곗췅冶 1110010111011111100000011110001110001111100010011110101111101000100111001110111110100111101101011110101010111110101010101110110010000010101001111110010110100100101101111110001010010001101110011110110010111101101000011110111110100111100111101110101110111101101100001110110010101101101000001110010110100111 e5df81e38f89ebe89cefa7b5eabeaaec82a7e5a4b7e291b9ecbda1efa79eebbdb0ecada0e5a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)