To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ??f?宥?????日??音??暗??? 1110010011101000100000101110101000111111001111111000001010000110001111111001011101000111001111110011111100111111001111110011111110010011111110100011111100111111100010011011100100111111001111111000100011000011001111110011111100111111 e4e882ea3f3f82863f97473f3f3f3f3f93fa3f3f89b93f3f88c33f3f3f
EUC-JP 蒻れ??f?宥?????日??音??暗??? 1110100011101010101001001110110000111111001111111010001111100110001111111100110110101000001111110011111100111111001111110011111111000110111111000011111100111111101100101011101100111111001111111011000011000101001111110011111100111111 e8eaa4ec3f3fa3e63fcda83f3f3f3f3fc6fc3f3fb2bb3f3fb0c53f3f3f
UTF-8 蒻れ슦杻f룚宥살첎廬믩벝日믣쮦音산턄暗싲틹梨 111010001001001010111011111000111000001010001100111011001000101010100110111011111010011110001000111011111011110110000110111010111010001110011010111001011010111010100101111011001000001010110100111011001011001010001110111011111010011010000010111010111010111110101001111010111011001010011101111001101001011110100101111010111010111110100011111011001010111010100110111010011001111110110011111011001000001010110000111011011000010010000100111001101001101010010111111011001000101110110010111011011000101110111001111011111010011110100010 e892bbe3828cec8aa6efa788efbd86eba39ae5aea5ec82b4ecb28eefa682ebafa9ebb29de697a5ebafa3ecaea6e99fb3ec82b0ed8484e69a97ec8bb2ed8bb9efa7a2
UHC 蒻れ슦杻f룚宥살첎廬믩벝日믣쮦音산턄暗싲틹梨 1110010110110110101010101110110010011010101100001110101011110100101000111110011010001111100101101110101011101001101110111110110010101010100110111110010111111110100100101110101110010011101110001110110011101101100100101110010110101000100000111110101111100101101110111110101010110101101000001110010011011110100110101110101110111010100111111110110010110001 e5b6aaec9ab0eaf4a3e68f96eae9bbecaa9be5fe92eb93b8eced92e5a883ebe5bbeab5a0e4de9aebba9fecb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)