To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W?? 0011111100111111001111110011111100111111001111110011111100111111010101110011111100111111 3f3f3f3f3f3f3f3f573f3f
SJIS-WIN 螳溷脅螻。遽檎明W螻。 11100101101011101001111111100101100010111011101011100101101100011010000111100111101011111000110011100111100101101011111001010111111001011011000110100001 e5ae9fe58bbae5b1a1e7af8ce796be57e5b1a1
EUC-JP 螳溷脅螻。遽檎明W螻。 111010101011000011011110111001111011011010111100111010101011001110001110101000011110111010110001101110001110100111001100110000000101011111101010101100111000111010100001 eab0dee7b6bceab38ea1eeb1b8e9ccc057eab38ea1
UTF-8 螳溷脅螻。遽檎明W螻。 11101000100111101011001111100110101110101011011111101000100001001000010111101000100111101011101111101111101111011010000111101001100000011011110111100110101010101000111011100110100110001000111001010111111010001001111010111011111011111011110110100001 e89eb3e6bab7e88485e89ebbefbda1e981bde6aa8ee6988e57e89ebbefbda1
UHC 螳?脅??遽檎明W?? 11010011110110010011111111111010111101100011111100111111110010111110100011010000110101011101100110100101010101110011111100111111 d3d93ffaf63f3fcbe8d0d5d9a5573f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)