To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 癲れ???┴矣??[癲れ???┴矣??[^ 1110000110011111100000101110101000111111001111110011111110000100101010001110000111100001001111110011111101011011111000011001111110000010111010100011111100111111001111111000010010101000111000011110000100111111001111110101101101011110 e19f82ea3f3f3f84a8e1e13f3f5be19f82ea3f3f3f84a8e1e13f3f5b5e
EUC-JP 癲れ???┴矣??[癲れ???┴矣??[^ 1110001010100001101001001110110000111111001111110011111110101000101010101110001011100011001111110011111101011011111000101010000110100100111011000011111100111111001111111010100010101010111000101110001100111111001111110101101101011110 e2a1a4ec3f3f3fa8aae2e33f3f5be2a1a4ec3f3f3fa8aae2e33f3f5b5e
UTF-8 癲れ쉯杻삼┴矣뉖뼸[癲れ쉯杻삼┴矣뉖뼸[^ 111001111001100110110010111000111000001010001100111011001000100110101111111011111010011110001000111011001000001010111100111000101001010010110100111001111001111110100011111010111000100110010110111010111011110010111000010110111110011110011001101100101110001110000010100011001110110010001001101011111110111110100111100010001110110010000010101111001110001010010100101101001110011110011111101000111110101110001001100101101110101110111100101110000101101101011110 e799b2e3828cec89afefa788ec82bce294b4e79fa3eb8996ebbcb85be799b2e3828cec89afefa788ec82bce294b4e79fa3eb8996ebbcb85b5e
UHC 癲れ쉯杻삼┴矣뉖뼸[癲れ쉯杻삼┴矣뉖뼸[^ 111011111010011010101010111011001001101010000111111010101111010010111011111011111010011010101010111010111111100010000111111010111001011010111011010110111110111110100110101010101110110010011010100001111110101011110100101110111110111110100110101010101110101111111000100001111110101110010110101110110101101101011110 efa6aaec9a87eaf4bbefa6aaebf887eb96bb5befa6aaec9a87eaf4bbefa6aaebf887eb96bb5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)