To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰??衰⑦?汚??壹?┃怨λ?裔?? 11100001100111110011111100111111111000001010011100111111001111111001000010001010100001110100011000111111100010011001100000111111001111111001101011100011001111111000010010101011100010011000010110000011110010010011111111100101111000010011111100111111 e19f3f3fe0a73f3f908a87463f89983f3f9ae33f84ab898583c93fe5e13f3f
EUC-JP 癲??爰??衰??汚??壹?┃怨λ?裔?? 111000101010000100111111001111111110000010101001001111110011111110111111111010100011111100111111101100011111100000111111001111111101010011100101001111111010100010101101101100011110010110100110110010110011111111101010111000110011111100111111 e2a13f3fe0a93f3fbfea3f3fb1f83f3fd4e53fa8adb1e5a6cb3feae33f3f
UTF-8 癲쒕짅爰녵튃衰⑦뫛汚살닂壹삯┃怨λ젣裔됰뜥 1110011110011001101100101110110010010010100101011110110010100111100001011110011110001000101100001110101110000101101101011110110110001010100000111110100010100001101100001110001010010001101001101110101110101011100110111110011010110001100110101110110010000010101101001110101110001011100000101110010110100011101110011110110010000010101011111110001010010100100000111110011010000000101010001100111010111011111011001010000010100011111010001010001110010100111010111001000010110000111010111001110010100101 e799b2ec9295eca785e788b0eb85b5ed8a83e8a1b0e291a6ebab9be6b19aec82b4eb8b82e5a3b9ec82afe29483e680a8cebbeca0a3e8a394eb90b0eb9ca5
UHC 癲쒕짅爰녵튃衰⑦뫛汚살닂壹삯┃怨λ젣裔됰뜥 111011111010011010011100111010111010001110010100111010101011101010000110111001001011100110011001111000011111000110101000111011011001000110111011111001111111110110111011111011001000100010001011111011001110110010111011111010011010011010101101111010101011001110100101111010111010000010011100111001111110000010001001111010111000110110101000 efa69ceba394eaba86e4b999e1f1a8ed91bbe7fdbbec888bececbbe9a6adeab3a5eba09ce7e089eb8da8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)