To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴨??泣??伎泣??邑 10001010100110110011111100111111100010111000001100111111001111111000101011101010100010111000001100111111001111111001011101010111 8a9b3f3f8b833f3f8aea8b833f3f9757
EUC-JP 鴨??泣??伎泣??邑 10110011111110110011111100111111101101011110001100111111001111111011010011101100101101011110001100111111001111111100110110111000 b3fb3f3fb5e33f3fb4ecb5e33f3fcdb8
UTF-8 鴨뱀쉻泣쏃뇦伎泣쏁독邑 111010011011010010101000111010111011000110000000111011001000100110111011111001101011001110100011111011001000111110000011111010111000011110100110111001001011110010001110111001101011001110100011111011001000111110000001111010111000111110000101111010011000001010010001 e9b4a8ebb180ec89bbe6b3a3ec8f83eb87a6e4bc8ee6b3a3ec8f81eb8f85e98291
UHC 鴨뱀쉻泣쏃뇦伎泣쏁독邑 11100100111001011011100111101100100110101001000111101011111010001001101111101001100001111000111011010000111010111110101111101000100110111110011110110101101101101110101111101001 e4e5b9ec9a91ebe89be9878ed0ebebe89be7b5b6ebe9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)