To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲????㎝宋??巍ル?異??矜維??惟?? 11100001100111110011111100111111001111110011111110000111011100001001000101110110001111110011111110011011110110011000001110001011001111111000100011011001001111110011111111100001111000001000100011011011001111110011111110001000110100100011111100111111 e19f3f3f3f3f877091763f3f9bd9838b3f88d93f3fe1e088db3f3f88d23f3f
EUC-JP 癲??堉??宋??巍ル?異??矜維??惟?? 1110001010100001001111110011111110001111101101111111110100111111001111111100000111010111001111110011111111010110110110111010010111101011001111111011000011011011001111110011111111100010111000101011000011011101001111110011111110110000110101000011111100111111 e2a13f3f8fb7fd3f3fc1d73f3fd6dba5eb3fb0db3f3fe2e2b0dd3f3fb0d43f3f
UTF-8 癲삳끃堉먲㎝宋먮뙕巍ル쵐異삣슫矜維뺡레惟깆떻 111001111001100110110010111011001000001010110011111010111000000110000011111001011010000010001001111010111010100010110010111000111000111010011101111001011010111010001011111010111010100010101110111010111001100110010101111001011011011110001101111000111000001110101011111011001011010110010000111001111001010110110000111011001000001010100011111011001000101010101011111001111001111110011100111001111011011010101101111010111011101010100001111010111010000010001000111001101000001110011111111010101011100110000110111010111001011010111011 e799b2ec82b3eb8183e5a089eba8b2e38e9de5ae8beba8aeeb9995e5b78de383abecb590e795b0ec82a3ec8aabe79f9ce7b6adebbaa1eba088e6839feab986eb96bb
UHC 癲삳끃堉먲㎝宋먮뙕巍ル쵐異삣슫矜維뺡레惟깆떻 1110111110100110101110111110101110000101101110011110101110111100100100001110111110100111101011111110000111100100100100001110101110001100100110101110100011100100101010111110101110101100100100101110110010110110101110111110010110011010101101001101000011101000111010111010101110010101111010011011011110111001111010101110111010110001111011001011011010111011 efa6bbeb85b9ebbc90efa7afe1e490eb8c9ae8e4abebac92ecb6bbe59ab4d0e8ebab95e9b7b9eaeeb1ecb6bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)