To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??怨??押る?愉??乙??額 111000011001111100111111001111111000100011010011001111110011111110001001100001010011111100111111100010011001111110000010111010010011111110010110111110010011111100111111100010011011001100111111001111111000101001111010 e19f3f3f88d33f3f89853f3f899f82e93f96f93f3f89b33f3f8a7a
EUC-JP 癲??意??怨??押る?愉??乙??額 111000101010000100111111001111111011000011010101001111110011111110110001111001010011111100111111101100101010000110100100111010110011111111001100111110110011111100111111101100101011010100111111001111111011001111011011 e2a13f3fb0d53f3fb1e53f3fb2a1a4eb3fccfb3f3fb2b53f3fb3db
UTF-8 癲됱떜意쎿끽怨삘뵶押る굞愉녔슅乙좊걖額 111001111001100110110010111010111001000010110001111010111001011010011100111001101000010010001111111011001000111010111111111010111000000110111101111001101000000010101000111011001000001010011000111010111011010110110110111001101000101010111100111000111000001010001011111010101011010110011110111001101000010010001001111010111000010110010100111011001000101010000101111001001011100110011001111011001010001010001010111010101011000110010110111010011010000110001101 e799b2eb90b1eb969ce6848fec8ebfeb81bde680a8ec8298ebb5b6e68abce3828beab59ee68489eb8594ec8a85e4b999eca28aeab196e9a18d
UHC 癲됱떜意쎿끽怨삘뵶押る굞愉녔슅乙좊걖額 1110111110100110100010011110110010001011101100101110101111110010100110111110011010110011101000111110101010110011101110111110001010010100101101001110010011100011101010101110101110000010100001101110101011110000101100111110011010011010100101111110101111100000101000001110101110000001100000011110010011111110 efa689ec8bb2ebf29be6b3a3eab3bbe294b4e4e3aaeb8286eaf0b3e69a97ebe0a0eb8181e4fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)