To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 楷?裨W^楷?裨\}v楷?裨W^楷?裨\}vB 10011110101100100011111111100101111010010101011101011110100111101011001000111111111001011110100101011100011111010111011010011110101100100011111111100101111010010101011101011110100111101011001000111111111001011110100101011100011111010111011001000010 9eb23fe5e9575e9eb23fe5e95c7d769eb23fe5e9575e9eb23fe5e95c7d7642
EUC-JP 楷?裨W^楷?裨\}v楷?裨W^楷?裨\}vB 11011100101101000011111111101010111010110101011101011110110111001011010000111111111010101110101101011100011111010111011011011100101101000011111111101010111010110101011101011110110111001011010000111111111010101110101101011100011111010111011001000010 dcb43feaeb575edcb43feaeb5c7d76dcb43feaeb575edcb43feaeb5c7d7642
UTF-8 楷곈裨W^楷곈裨\}v楷곈裨W^楷곈裨\}vB 1110011010100101101101111110101010110011100010001110100010100011101010000101011101011110111001101010010110110111111010101011001110001000111010001010001110101000010111000111110101110110111001101010010110110111111010101011001110001000111010001010001110101000010101110101111011100110101001011011011111101010101100111000100011101000101000111010100001011100011111010111011001000010 e6a5b7eab388e8a3a8575ee6a5b7eab388e8a3a85c7d76e6a5b7eab388e8a3a8575ee6a5b7eab388e8a3a85c7d7642
UHC 楷곈裨W^楷곈裨\}v楷곈裨W^楷곈裨\}vB 1111101010101100101100001110100111011110101001010101011101011110111110101010110010110000111010011101111010100101010111000111110101110110111110101010110010110000111010011101111010100101010101110101111011111010101011001011000011101001110111101010010101011100011111010111011001000010 faacb0e9dea5575efaacb0e9dea55c7d76faacb0e9dea5575efaacb0e9dea55c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)