To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?苑??怨??筌??游??怨κ?億?? 111000011001111110000011100010110011111110001001100100010011111100111111100010011000010100111111001111111110001010100011001111110011111110011111111000000011111100111111100010011000010110000011110010000011111110001001101011010011111100111111 e19f838b3f89913f3f89853f3fe2a33f3f9fe03f3f898583c83f89ad3f3f
EUC-JP 癲ル?苑??怨??筌??游??怨κ?億?? 111000101010000110100101111010110011111110110001111100010011111100111111101100011110010100111111001111111110010010100101001111110011111111011110111000100011111100111111101100011110010110100110110010100011111110110010101011110011111100111111 e2a1a5eb3fb1f13f3fb1e53f3fe4a53f3fdee23f3fb1e5a6ca3fb2af3f3f
UTF-8 癲ル슓苑볡땟怨⑹쾸筌뤿슔游루넭怨κ덱億됰몘 1110011110011001101100101110001110000011101010111110110010001010100100111110100010001011100100011110101110110011101000011110101110010101100111111110011010000000101010001110001010010001101110011110110010111110101110001110011110101101100011001110101110100100101111111110110010001010100101001110011010111000101110001110101110100011101010001110101110000100101011011110011010000000101010001100111010111010111010111000110110110001111001011000010010000100111010111001000010110000111010111010101010011000 e799b2e383abec8a93e88b91ebb3a1eb959fe680a8e291b9ecbeb8e7ad8ceba4bfec8a94e6b8b8eba3a8eb84ade680a8cebaeb8db1e58484eb90b0ebaa98
UHC 癲ル슓苑볡땟怨⑹쾸筌뤿슔游루넭怨κ덱億됰몘 111011111010011010101011111010111001101010100010111010101011110110010011111001111011011010101101111010101011001110101001111011001011001010001110111011111010011110001111111010111001101010100011111010101111110110110111111001111000011010101100111010101011001110100101111010101011010110100110111001011110001010001001111010111001000110000110 efa6abeb9aa2eabd93e7b6adeab3a9ecb28eefa78feb9aa3eafdb7e786aceab3a5eab5a6e5e289eb9186

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)