To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢g????????癲??泣ょ? 1110000110011111100000111000101100111111100010001110110010000010100001110011111100111111001111110011111100111111001111110011111100111111111000011001111100111111001111111000101110000011100000101110010100111111 e19f838b3f88ec82873f3f3f3f3f3f3f3fe19f3f3f8b8382e53f
EUC-JP 癲ル?溢g????孼???癲??泣ょ? 11100010101000011010010111101011001111111011000011101110101000111110011100111111001111110011111100111111100011111011101011000011001111110011111100111111111000101010000100111111001111111011010111100011101001001110011100111111 e2a1a5eb3fb0eea3e73f3f3f3f8fbac33f3f3fe2a13f3fb5e3a4e73f
UTF-8 癲ル슪溢g뎁硫곗땡孼꾊딇꺍癲쒖슜泣ょ킈 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011111011110110000111111010111000111010000001111011111010011110001110111010101011001110010111111010111001010110100001111001011010110110111100111010101011111010001010111010111001010010000111111010101011101010001101111001111001100110110010111011001001001010010110111011001000101010011100111001101011001110100011111000111000001010000111111011011000001010001000 e799b2e383abec8aaae6baa2efbd87eb8e81efa78eeab397eb95a1e5adbceabe8aeb9487eaba8de799b2ec9296ec8a9ce6b3a3e38287ed8288
UHC 癲ル슪溢g뎁硫곗땡孼꾊딇꺍癲쒖슜泣ょ킈 1110111110100110101010111110101110011010101100111110110011101110101000111110011110110101101010101110101110101001101100001110110010110110101011111110010111101101100001001101000110001010111011011000001110110011111011111010011010011100111011001001101010101001111010111110100010101010111001111011010010010100 efa6abeb9ab3eceea3e7b5aaeba9b0ecb6afe5ed84d18aed83b3efa69cec9aa9ebe8aae7b494

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)