To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8倚??議??汚??日?─????λ? 1110000110011111001111111000001001010111100110001101111100111111001111111000101101100011001111110011111110001001100110000011111100111111100100111111101000111111100001001001111100111111001111110011111100111111100000111100100100111111 e19f3f825798df3f3f8b633f3f89983f3f93fa3f849f3f3f3f3f83c93f
EUC-JP 癲?8倚??議??汚??日?─????λ? 1110001010100001001111111010001110111000110100001110000100111111001111111011010111000100001111110011111110110001111110000011111100111111110001101111110000111111101010001010000100111111001111110011111100111111101001101100101100111111 e2a13fa3b8d0e13f3fb5c43f3fb1f83f3fc6fc3fa8a13f3f3f3fa6cb3f
UTF-8 癲쒕8倚쒏쾮議얩뫛汚살닂日뗰─紐꾨퉪若λ븩 1110011110011001101100101110110010010010100101011110111110111100100110001110010110000000100110101110110010010010100011111110110010111110101011101110100010101101101100001110110010010110101010011110101110101011100110111110011010110001100110101110110010000010101101001110101110001011100000101110011010010111101001011110101110010111101100001110001010010100100000001110111110100111100011111110101010111110101010001110110110001001101010101110111110100101101101001100111010111011111010111011100010101001 e799b2ec9295efbc98e5809aec928fecbeaee8adb0ec96a9ebab9be6b19aec82b4eb8b82e697a5eb97b0e29480efa78feabea8ed89aaefa5b4cebbebb8a9
UHC 癲쒕8倚쒏쾮議얩뫛汚살닂日뗰─紐꾨퉪若λ븩 111011111010011010011100111010111010001110111000111010111110111110011100111001101011001010000101111011001010000110111110111011011001000110111011111001111111110110111011111011001000100010001011111011001110110110001011111011111010011010100001111010111010101010000100111010111011100110000010111001011010111010100101111010111001010110010010 efa69ceba3b8ebef9ce6b285eca1beed91bbe7fdbbec888beced8befa6a1ebaa84ebb982e5aea5eb9592

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)