To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???意??節??押る?異??循?????弛 0011111100111111001111111000100011010011001111110011111110010000110111110011111100111111100010011001111110000010111010010011111110001000110110010011111100111111100011110111101000111111001111110011111100111111001111111001001001101111 3f3f3f88d33f3f90df3f3f899f82e93f88d93f3f8f7a3f3f3f3f3f926f
EUC-JP ???意??節??押る?異??循?????弛 0011111100111111001111111011000011010101001111110011111111000000111000010011111100111111101100101010000110100100111010110011111110110000110110110011111100111111101111011101101100111111001111110011111100111111001111111100001111010000 3f3f3fb0d53f3fc0e13f3fb2a1a4eb3fb0db3f3fbddb3f3f3f3f3fc3d0
UTF-8 捻뀀슢意쎿벚節뉖쇀押る굞異밧춢循뗫룂料곗슱弛 111011111010011010100100111010111000000010000000111011001000101010100010111001101000010010001111111011001000111010111111111010111011001010011010111001111010111110000000111010111000100110010110111011001000011110000000111001101000101010111100111000111000001010001011111010101011010110011110111001111001010110110000111010111011000010100111111011001011011010100010111001011011111010101010111010111001011110101011111010111010001110000010111011111010011010111110111010101011001110010111111011001000101010110001111001011011110010011011 efa6a4eb8080ec8aa2e6848fec8ebfebb29ae7af80eb8996ec8780e68abce3828beab59ee795b0ebb0a7ecb6a2e5beaaeb97abeba382efa6beeab397ec8ab1e5bc9b
UHC 捻뀀슢意쎿벚節뉖쇀押る굞異밧춢循뗫룂料곗슱弛 1110011011110111101100101110101110011010101011101110101111110010100110111110011010111010101000101110111110111101100001111110101110011001101101001110010011100011101010101110101110000010100001101110110010110110101110011110010110101101100000111110001011100000100010111110101110001111100000111110100011110111101100001110110010011010101110001110110010101100 e6f7b2eb9aaeebf29be6baa2efbd87eb99b4e4e3aaeb8286ecb6b9e5ad83e2e08beb8f83e8f7b0ec9ab8ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)