To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??泣??臾??艾??鸚??泣??臾??艾 111010100101111100111111001111111000101110000011001111110011111111100100011010110011111100111111111001001000100000111111001111111110101001011111001111110011111110001011100000110011111100111111111001000110101100111111001111111110010010001000 ea5f3f3f8b833f3fe46b3f3fe4883f3fea5f3f3f8b833f3fe46b3f3fe488
EUC-JP 鸚??泣??臾??艾?ˇ鸚??泣??臾??艾 1111001111000000001111110011111110110101111000110011111100111111111001111100110000111111001111111110011111101000001111111000111110100010101100001111001111000000001111110011111110110101111000110011111100111111111001111100110000111111001111111110011111101000 f3c03f3fb5e33f3fe7cc3f3fe7e83f8fa2b0f3c03f3fb5e33f3fe7cc3f3fe7e8
UTF-8 鸚뽰닂泣앯독臾덈굩艾쎈ˇ鸚뽰닂泣앯독臾덈굩艾 1110100110111000100110101110101110111101101100001110101110001011100000101110011010110011101000111110110010010101101011111110101110001111100001011110100010000111101111101110101110001101100010001110101010110101101010011110100010001001101111101110110010001110100010001100101110000111111010011011100010011010111010111011110110110000111010111000101110000010111001101011001110100011111011001001010110101111111010111000111110000101111010001000011110111110111010111000110110001000111010101011010110101001111010001000100110111110 e9b89aebbdb0eb8b82e6b3a3ec95afeb8f85e887beeb8d88eab5a9e889beec8e88cb87e9b89aebbdb0eb8b82e6b3a3ec95afeb8f85e887beeb8d88eab5a9e889be
UHC 鸚뽰닂泣앯독臾덈굩艾쎈ˇ鸚뽰닂泣앯독臾덈굩艾 1110010110100100100101101110110010001000100010111110101111101000100111011110011110110101101101101110101110101100100010001110101110000010100011111110010011110101101111011110101110100010101001111110010110100100100101101110110010001000100010111110101111101000100111011110011110110101101101101110101110101100100010001110101110000010100011111110010011110101 e5a496ec888bebe89de7b5b6ebac88eb828fe4f5bdeba2a7e5a496ec888bebe89de7b5b6ebac88eb828fe4f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)