To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???淫??蔭??淫?????吟??吟ъ?B 00111111001111110011111110001000111110100011111100111111100010001111110000111111001111111000100011111010001111110011111100111111001111110011111110001011111000010011111100111111100010111110000110000100100011000011111101000010 3f3f3f88fa3f3f88fc3f3f88fa3f3f3f3f3f8be13f3f8be1848c3f42
EUC-JP ???淫??蔭??淫?????吟??吟ъ?B 00111111001111110011111110110000111111000011111100111111101100001111111000111111001111111011000011111100001111110011111100111111001111110011111110110110111000110011111100111111101101101110001110100111111011000011111101000010 3f3f3fb0fc3f3fb0fe3f3fb0fc3f3f3f3f3fb6e33f3fb6e3a7ec3f42
UTF-8 溜깅젡淫㏃꺎蔭곗꺃淫좊젿溜싳꽘吟댁뀺吟ъ넱B 111011111010011110001011111010101011100110000101111011001010000010100001111001101011011110101011111000111000111110000011111010101011101010001110111010001001010010101101111010101011001110010111111010101011101010000011111001101011011110101011111011001010001010001010111011001010000010111111111011111010011110001011111011001000101110110011111010101011110110011000111001011001000010011111111010111000110010000001111010111000000010111010111001011001000010011111110100011000101011101011100001001011000101000010 efa78beab985eca0a1e6b7abe38f83eaba8ee894adeab397eaba83e6b7abeca28aeca0bfefa78bec8bb3eabd98e5909feb8c81eb80bae5909fd18aeb84b142
UHC 溜깅젡淫㏃꺎蔭곗꺃淫좊젿溜싳꽘吟댁뀺吟ъ넱B 11101010111111101011000111101011101000001001101011101011111000101010011111101100100000111011010011101011111000111011000011101100100000111010110011101011111000101010000011101011101000001011000111101010111111101001101011101100100001001010011111101011111000011011010011101100100001011011000011101011111000011010110011101100100001101011000001000010 eafeb1eba09aebe2a7ec83b4ebe3b0ec83acebe2a0eba0b1eafe9aec84a7ebe1b4ec85b0ebe1acec86b042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)