To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 永??永??占θ?Lh永??永??占θ?L 1000100101101001001111110011111110001001011010010011111100111111100100001110100010000011110001100011111101001100011010001000100101101001001111110011111110001001011010010011111100111111100100001110100010000011110001100011111101001100 89693f3f89693f3f90e883c63f4c6889693f3f89693f3f90e883c63f4c
EUC-JP 永??永??占θ?Lh永??永??占θ?L 1011000111001010001111110011111110110001110010100011111100111111110000001110101010100110110010000011111101001100011010001011000111001010001111110011111110110001110010100011111100111111110000001110101010100110110010000011111101001100 b1ca3f3fb1ca3f3fc0eaa6c83f4c68b1ca3f3fb1ca3f3fc0eaa6c83f4c
UTF-8 永롥콉永귡릶占θ뮁Lh永롥콉永귡릶占θ뮁L 11100110101100001011100011101011101000011010010111101100101111011000100111100110101100001011100011101010101101111010000111101011101001101011011011100101100011011010000011001110101110001110101110101110100000010100110001101000111001101011000010111000111010111010000110100101111011001011110110001001111001101011000010111000111010101011011110100001111010111010011010110110111001011000110110100000110011101011100011101011101011101000000101001100 e6b0b8eba1a5ecbd89e6b0b8eab7a1eba6b6e58da0ceb8ebae814c68e6b0b8eba1a5ecbd89e6b0b8eab7a1eba6b6e58da0ceb8ebae814c
UHC 永롥콉永귡릶占θ뮁Lh永롥콉永귡릶占θ뮁L 111001111011010110001110111001011011000110000101111001111011010110000010111010011001000010010100111011111011111110100101111010001001001010010000010011000110100011100111101101011000111011100101101100011000010111100111101101011000001011101001100100001001010011101111101111111010010111101000100100101001000001001100 e7b58ee5b185e7b582e99094efbfa5e892904c68e7b58ee5b185e7b582e99094efbfa5e892904c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)