To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 畏??慂?????節??節??^ 1000100011011000001111110011111110011100110010000011111100111111001111110011111100111111100100001101111100111111001111111001000011011111001111110011111101011110 88d83f3f9cc83f3f3f3f3f90df3f3f90df3f3f5e
EUC-JP 畏??慂?????節??節??^ 1011000011011010001111110011111111011000110010100011111100111111001111110011111100111111110000001110000100111111001111111100000011100001001111110011111101011110 b0da3f3fd8ca3f3f3f3f3fc0e13f3fc0e13f3f5e
UTF-8 畏놅슐慂딉쉿轢녽늽節띌퐡節꿩펵^ 11100111100101011000111111101011100001101000010111101100100010101001000011100110100001011000001011101011100101001000100111101100100010011011111111101111101001101000110111101011100001011011110111101011100010101011110111100111101011111000000011101011100111011000110011101101100100001010000111100111101011111000000011101010101111111010100111101101100011101011010101011110 e7958feb8685ec8a90e68582eb9489ec89bfefa68deb85bdeb8abde7af80eb9d8ced90a1e7af80eabfa9ed8eb55e
UHC 畏놅슐慂딉쉿轢녽늽節띌퐡節꿩펵^ 11101000111001101000011011101111101111011011011011101001101111011000101011101111101111011011001011100110101111001000011011101001100010001000011011101111101111011011011011101001101111011000101011101111101111011011001011100110101111001000011001011110 e8e686efbdb6e9bd8aefbdb2e6bc86e98886efbdb6e9bd8aefbdb2e6bc865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)