To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 燿??搖??予??竊??節↓?殃??外??^ 111000001010000000111111001111111001110110001010001111110011111110010111010111000011111100111111111000101000011000111111001111111001000011011111100000011010101100111111100111110110100100111111001111111000101001001111001111110011111101011110 e0a03f3f9d8a3f3f975c3f3fe2863f3f90df81ab3f9f693f3f8a4f3f3f5e
EUC-JP 燿??搖??予??竊??節↓?殃??外??^ 111000001010001000111111001111111101100111101010001111110011111111001101101111010011111100111111111000111110011000111111001111111100000011100001101000101010110100111111110111011100101000111111001111111011001110110000001111110011111101011110 e0a23f3fd9ea3f3fcdbd3f3fe3e63f3fc0e1a2ad3fddca3f3fb3b03f3f5e
UTF-8 燿먲숯搖㎩똻予숅꽇竊롩옄節↓뼤殃곫슇外숋펿^ 11100111100001111011111111101011101010001011001011101100100010001010111111100110100100001001011011100011100011101010100111101011100110001011101111100100101110101000100011101100100010001000010111101010101111011000011111100111101010111000101011101011101000011010100111101100100110001000010011100111101011111000000011100010100001101001001111101011101111001010010011100110101011101000001111101010101100111010101111101100100010101000011111100101101001001001011011101100100010001000101111101101100011101011111101011110 e787bfeba8b2ec88afe69096e38ea9eb98bbe4ba88ec8885eabd87e7ab8aeba1a9ec9884e7af80e28693ebbca4e6ae83eab3abec8a87e5a496ec888bed8ebf5e
UHC 燿먲숯搖㎩똻予숅꽇竊롩옄節↓뼤殃곫슇外숋펿^ 11101000111111001001000011101111101111011010000111101000111101001010011111100101100011001000000111100101111110001001100111101001100001001001100111101111101111001000111011101001100111101001000011101111101111011010000111101001100101101010011111100100111010101000000111100110100110101001100111101000111000101001100111101111101111001000111001011110 e8fc90efbda1e8f4a7e58c81e5f899e98499efbc8ee99e90efbda1e996a7e4ea81e69a99e8e299efbc8e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)