To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??濡??鴦??以??音????? 0011111100111111001111111000100110110011001111110011111110010100010001110011111100111111111010011111000100111111001111111000100011001000001111110011111110001001101110010011111100111111001111110011111100111111 3f3f3f89b33f3f94473f3fe9f13f3f88c83f3f89b93f3f3f3f3f
EUC-JP 縯??乙??濡??鴦??以??音????? 10001111110101001100101100111111001111111011001010110101001111110011111111000111101010000011111100111111111100101111001100111111001111111011000011001010001111110011111110110010101110110011111100111111001111110011111100111111 8fd4cb3f3fb2b53f3fc7a83f3ff2f33f3fb0ca3f3fb2bb3f3f3f3f3f
UTF-8 縯롪낳乙꿰독濡뀀닔鴦볤맏以됭쵟音쎌뎾輦깊룞 111001111011100010101111111010111010000110101010111010111000001010110011111001001011100110011001111010101011111110110000111010111000111110000101111001101011111110100001111010111000000010000000111010111000101110010100111010011011010010100110111010111011001110100100111010111010011110001111111001001011101110100101111010111001000010101101111011001011010110011111111010011001111110110011111011001000111010001100111010111000111010111110111011111010011010011000111010101011100110001010111010111010001110011110 e7b8afeba1aaeb82b3e4b999eabfb0eb8f85e6bfa1eb8080eb8b94e9b4a6ebb3a4eba78fe4bba5eb90adecb59fe99fb3ec8e8ceb8ebeefa698eab98aeba39e
UHC 縯롪낳乙꿰독濡뀀닔鴦볤맏以됭쵟音쎌뎾輦깊룞 111001101110000010001110111010101011001110111010111010111110000010110010111001111011010110110110111010111010000110110010111010111000100010011000111001001110110010010011111010101011100010111010111011001010010010001001111010001010110010100000111010111110010110111101111011001000100110010001111001101110010010110001111011011000111110011001 e6e08eeab3baebe0b2e7b5b6eba1b2eb8898e4ec93eab8baeca489e8aca0ebe5bdec8991e6e4b1ed8f99

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)