To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烏k?冶??諛??諛?。???鴉??業??^ 100010010100011110000010100010110011111110010110111010000011111100111111111001101000011100111111001111111110011010000111001111111000000101000010001111110011111100111111111010011110101100111111001111111000101111000110001111110011111101011110 8947828b3f96e83f3fe6873f3fe6873f81423f3f3fe9eb3f3f8bc63f3f5e
EUC-JP 烏k?冶??諛??諛?。???鴉??業??^ 101100011010100010100011111010110011111111001100111010100011111100111111111010111110011100111111001111111110101111100111001111111010000110100011001111110011111100111111111100101110110100111111001111111011011011001000001111110011111101011110 b1a8a3eb3fccea3f3febe73f3febe73fa1a33f3f3ff2ed3f3fb6c83f3f5e
UTF-8 烏k젧冶ⓧ펯諛앮삺諛쏂。溜잌㎤鴉곷컼業깅컱^ 11100111100000111000111111101111101111011000101111101100101000001010011111100101100001101011011011100010100100111010011111101101100011101010111111101000101010111001101111101100100101011010111011101100100000101011101011101000101010111001101111101100100011111000001011100011100000001000001011101111101001111000101111101100100111101000110011100011100011101010010011101001101101001000100111101010101100111011011111101100101110111011110011100110101001011010110111101010101110011000010111101100101110111011000101011110 e7838fefbd8beca0a7e586b6e293a7ed8eafe8ab9bec95aeec82bae8ab9bec8f82e38082efa78bec9e8ce38ea4e9b489eab3b7ecbbbce6a5adeab985ecbbb15e
UHC 烏k젧冶ⓧ펯諛앮삺諛쏂。溜잌㎤鴉곷컼業깅컱^ 11101000101000011010001111101011101000001001111111100101101001111010100011100100101111001000000111101011101100001001110111100110100110001011000111101011101100001001101111101000101000011010001111101010111111101001111111100101101001111010100011100100101111001000000111101011101100001001110111100101111101101011000111101011101100001001011101011110 e8a1a3eba09fe5a7a8e4bc81ebb09de698b1ebb09be8a1a3eafe9fe5a7a8e4bc81ebb09de5f6b1ebb0975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)