To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??泣f?議??榮??誼??諛??蘂 111000011001111100111111001111111000101110000011100000101000011000111111100010110110001100111111001111111001111011000100001111110011111110001011011000100011111100111111111001101000011100111111001111111110010101000001 e19f3f3f8b8382863f8b633f3f9ec43f3f8b623f3fe6873f3fe541
EUC-JP 癲??泣f?議??榮??誼??諛??蘂 111000101010000100111111001111111011010111100011101000111110011000111111101101011100010000111111001111111101110011000110001111110011111110110101110000110011111100111111111010111110011100111111001111111110100110100010 e2a13f3fb5e3a3e63fb5c43f3fdcc63f3fb5c33f3febe73f3fe9a2
UTF-8 癲숈옓泣f쾮議얠를榮붿빖誼㎩넇諛댄뜑蘂 111001111001100110110010111011001000100010001000111011001001100010010011111001101011001110100011111011111011110110000110111011001011111010101110111010001010110110110000111011001001011010100000111010111010010110111100111001101010011010101110111010111011011010111111111010111011100110010110111010001010101010111100111000111000111010101001111010111000010010000111111010001010101110011011111010111000110010000100111010111001110010010001111010001001100010000010 e799b2ec8888ec9893e6b3a3efbd86ecbeaee8adb0ec96a0eba5bce6a6aeebb6bfebb996e8aabce38ea9eb8487e8ab9beb8c84eb9c91e89882
UHC 癲숈옓泣f쾮議얠를榮붿빖誼㎩넇諛댄뜑蘂 1110111110100110100110011110110010011110100110011110101111101000101000111110011010110010100001011110110010100001101111101110110010111000101001101110011110110100100101001110110010010101101110001110101111111110101001111110010110000110100101111110101110110000101101001110110110001101100101001110011111011110 efa699ec9e99ebe8a3e6b285eca1beecb8a6e7b494ec95b8ebfea7e58697ebb0b4ed8d94e7de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)