To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 暗????ぜ酉??域??誼∽?袁??沃 100010001100001100111111001111110011111100111111100000101011101010010011110100010011111100111111100010001110011000111111001111111000101101100010100000011110010000111111111001011100110100111111001111111001011110000000 88c33f3f3f3f82ba93d13f3f88e63f3f8b6281e43fe5cd3f3f9780
EUC-JP 暗????ぜ酉??域??誼∽?袁??沃 101100001100010100111111001111110011111100111111101001001011110011000110110100110011111100111111101100001110100000111111001111111011010111000011101000101110011000111111111010101100111100111111001111111100110111100000 b0c53f3f3f3fa4bcc6d33f3fb0e83f3fb5c3a2e63feacf3f3fcde0
UTF-8 暗삳쉴理롨ぜ酉몃뮅域㏓벡誼∽쬆袁㏃뫊沃 111001101001101010010111111011001000001010110011111011001000100110110100111011111010011110100100111010111010000110101000111000111000000110011100111010011000010110001001111010111010101010000011111010111010111010000101111001011001111110011111111000111000111110010011111010111011001010100001111010001010101010111100111000101000100010111101111011001010110010000110111010001010001010000001111000111000111110000011111010111010101110001010111001101011001010000011 e69a97ec82b3ec89b4efa7a4eba1a8e3819ce98589ebaa83ebae85e59f9fe38f93ebb2a1e8aabce288bdecac86e8a281e38f83ebab8ae6b283
UHC 暗삳쉴理롨ぜ酉몃뮅域㏓벡誼∽쬆袁㏃뫊沃 1110010011011110101110111110101110111101101011111110110010110101100011101110100010101010101111001110101110110111101110001110101110010010100101001110011010110100101001111110101110111010101001001110101111111110101000011110111110100110100111011110101010111110101001111110110010010001101011001110100010101010 e4debbebbdafecb58ee8aabcebb7b8eb9294e6b4a7ebbaa4ebfea1efa69deabea7ec91ace8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)