To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 菴ワ讐菴刻終菴刻従蔗 1110010010111101100000111000111110001111010100011110010010111101100011011000111110001111010010011110010010111101100011011000111110001111010111011110010011110010 e4bd838f8f51e4bd8d8f8f49e4bd8d8f8f5de4f2
EUC-JP 菴ワ讐菴刻終菴刻従蔗 1110100010111111101001011110111110111101101100101110100010111111101110011110111110111101101010101110100010111111101110011110111110111101101111101110100011110100 e8bfa5efbdb2e8bfb9efbdaae8bfb9efbdbee8f4
UTF-8 菴ワ讐菴刻終菴刻従蔗 111010001000111110110100111000111000001110101111111010001010111010010000111010001000111110110100111001011000100010111011111001111011010110000010111010001000111110110100111001011000100010111011111001011011111010010011111010001001010010010111 e88fb4e383afe8ae90e88fb4e588bbe7b582e88fb4e588bbe5be93e89497
UHC 菴ワ讐菴刻終菴刻?蔗 11100100111000001010101111101111111000101100001011100100111000001100101010111110111100001111101111100100111000001100101010111110001111111110110110111101 e4e0abefe2c2e4e0cabef0fbe4e0cabe3fedbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)