To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ??鴦拔?????n}??鴦拔?????n{^ 001111110011111111101001111100011001110101010101001111110011111100111111001111110011111101101110011111010011111100111111111010011111000110011101010101010011111100111111001111110011111100111111011011100111101101011110 3f3fe9f19d553f3f3f3f3f6e7d3f3fe9f19d553f3f3f3f3f6e7b5e
EUC-JP ??鴦拔?????n}??鴦拔?????n{^ 001111110011111111110010111100111101100110110110001111110011111100111111001111110011111101101110011111010011111100111111111100101111001111011001101101100011111100111111001111110011111100111111011011100111101101011110 3f3ff2f3d9b63f3f3f3f3f6e7d3f3ff2f3d9b63f3f3f3f3f6e7b5e
UTF-8 앉뤙鴦拔렗성씬렗샵n}앉뤙鴦拔렗성씬렗샵n{^ 1110110010010101100010011110101110100100100110011110100110110100101001101110011010001011100101001110101110100000100101111110110010000100101100011110110010010100101011001110101110100000100101111110110010000011101101010110111001111101111011001001010110001001111010111010010010011001111010011011010010100110111001101000101110010100111010111010000010010111111011001000010010110001111011001001010010101100111010111010000010010111111011001000001110110101011011100111101101011110 ec9589eba499e9b4a6e68b94eba097ec84b1ec94aceba097ec83b56e7dec9589eba499e9b4a6e68b94eba097ec84b1ec94aceba097ec83b56e7b5e
UHC 앉뤙鴦拔렗성씬렗샵n}앉뤙鴦拔렗성씬렗샵n{^ 1011111011001001100011111100100011100100111011001101101011111011100011101010110010111100101110101011111011000000100011101010110010111100101001010110111001111101101111101100100110001111110010001110010011101100110110101111101110001110101011001011110010111010101111101100000010001110101011001011110010100101011011100111101101011110 bec98fc8e4ecdafb8eacbcbabec08eacbca56e7dbec98fc8e4ecdafb8eacbcbabec08eacbca56e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)