To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???陰??淫??蔭??濡??蔭??蔭??B 00111111001111110011111110001001010000010011111100111111100010001111101000111111001111111000100011111100001111110011111110010100010001110011111100111111100010001111110000111111001111111000100011111100001111110011111101000010 3f3f3f89413f3f88fa3f3f88fc3f3f94473f3f88fc3f3f88fc3f3f42
EUC-JP ???陰??淫??蔭??濡??蔭??蔭??B 00111111001111110011111110110001101000100011111100111111101100001111110000111111001111111011000011111110001111110011111111000111101010000011111100111111101100001111111000111111001111111011000011111110001111110011111101000010 3f3f3fb1a23f3fb0fc3f3fb0fe3f3fc7a83f3fb0fe3f3fb0fe3f3f42
UTF-8 溜깅죱陰잛뀥淫앹뀱蔭⑸젿濡뚯넖蔭쒖꽍蔭㏃뀳B 11101111101001111000101111101010101110011000010111101100101000111011000111101001100110011011000011101100100111101001101111101011100000001010010111100110101101111010101111101100100101011011100111101011100000001011000111101000100101001010110111100010100100011011100011101100101000001011111111100110101111111010000111101011100110101010111111101011100001001001011011101000100101001010110111101100100100101001011011101010101111011000110111101000100101001010110111100011100011111000001111101011100000001011001101000010 efa78beab985eca3b1e999b0ec9e9beb80a5e6b7abec95b9eb80b1e894ade291b8eca0bfe6bfa1eb9aafeb8496e894adec9296eabd8de894ade38f83eb80b342
UHC 溜깅죱陰잛뀥淫앹뀱蔭⑸젿濡뚯넖蔭쒖꽍蔭㏃뀳B 11101010111111101011000111101011101000011000110011101011111001001001111111101100100001011001110011101011111000101001110111101100100001011010011111101011111000111010100111101011101000001011000111101011101000011000110011101100100001101001111111101011111000111001110011101100100001001001110111101011111000111010011111101100100001011010100101000010 eafeb1eba18cebe49fec859cebe29dec85a7ebe3a9eba0b1eba18cec869febe39cec849debe3a7ec85a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)