To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??游ヨ?攸??押??循??酉??癲 111000011001111100111111001111111001111111100000100000111000100000111111100111011011111100111111001111111000100110011111001111110011111110001111011110100011111100111111100100111101000100111111001111111110000110011111 e19f3f3f9fe083883f9dbf3f3f899f3f3f8f7a3f3f93d13f3fe19f
EUC-JP 癲??游ヨ?攸??押??循??酉??癲 111000101010000100111111001111111101111011100010101001011110100000111111110110101100000100111111001111111011001010100001001111110011111110111101110110110011111100111111110001101101001100111111001111111110001010100001 e2a13f3fdee2a5e83fdac13f3fb2a13f3fbddb3f3fc6d33f3fe2a1
UTF-8 癲욍깿游ヨ쯁攸됱돟押꾧퇍循뷴죰酉몃쑅癲 111001111001100110110010111011001001101010001101111010101011100110111111111001101011100010111000111000111000001110101000111011001010111110000001111001101001010010111000111010111001000010110001111010111000111110011111111001101000101010111100111010101011111010100111111011011000011110001101111001011011111010101010111010111011011110110100111011001010001110110000111010011000010110001001111010111010101010000011111011001001000110000101111001111001100110110010 e799b2ec9a8deab9bfe6b8b8e383a8ecaf81e694b8eb90b1eb8f9fe68abceabea7ed878de5beaaebb7b4eca3b0e98589ebaa83ec9185e799b2
UHC 癲욍깿游ヨ쯁攸됱돟押꾧퇍循뷴죰酉몃쑅癲 1110111110100110101111111110001110000011101010001110101011111101101010111110100010101000100111011110101011110010100010011110110010001001101001011110010011100011100001001110101010110111100111101110001011100000101110101110010110100001100010111110101110110111101110001110101110011100101001011110111110100110 efa6bfe383a8eafdabe8a89deaf289ec89a5e4e384eab79ee2e0bae5a18bebb7b8eb9ca5efa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)