To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^????v????^????vB 001111110011111100111111001111110101111000111111001111110011111100111111011101100011111100111111001111110011111101011110001111110011111100111111001111110111011001000010 3f3f3f3f5e3f3f3f3f763f3f3f3f5e3f3f3f3f7642
SJIS-WIN 鬯ア闊。^鬯ア髯閤v鬯ア闊。^鬯ア髯閤vB 11101001101011001011000111101000100010001010000101011110111010011010110010110001111010011001100110001101011111010111011011101001101011001011000111101000100010001010000101011110111010011010110010110001111010011001100110001101011111010111011001000010 e9acb1e888a15ee9acb1e9998d7d76e9acb1e888a15ee9acb1e9998d7d7642
EUC-JP 鬯ア闊。^鬯ア髯閤v鬯ア闊。^鬯ア髯閤vB 11110010101011101000111010110001111011111110100010001110101000010101111011110010101011101000111010110001111100011111100110111001110111100111011011110010101011101000111010110001111011111110100010001110101000010101111011110010101011101000111010110001111100011111100110111001110111100111011001000010 f2ae8eb1efe88ea15ef2ae8eb1f1f9b9de76f2ae8eb1efe88ea15ef2ae8eb1f1f9b9de7642
UTF-8 鬯ア闊。^鬯ア髯閤v鬯ア闊。^鬯ア髯閤vB 1110100110101100101011111110111110111101101100011110100110010111100010101110111110111101101000010101111011101001101011001010111111101111101111011011000111101001101010111010111111101001100101101010010001110110111010011010110010101111111011111011110110110001111010011001011110001010111011111011110110100001010111101110100110101100101011111110111110111101101100011110100110101011101011111110100110010110101001000111011001000010 e9acafefbdb1e9978aefbda15ee9acafefbdb1e9abafe996a476e9acafefbdb1e9978aefbda15ee9acafefbdb1e9abafe996a47642
UHC ??闊?^???閤v??闊?^???閤vB 00111111001111111111110011000100001111110101111000111111001111110011111111111001111011100111011000111111001111111111110011000100001111110101111000111111001111110011111111111001111011100111011001000010 3f3ffcc43f5e3f3f3ff9ee763f3ffcc43f5e3f3f3ff9ee7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)