To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???淫ъ?陰??淫?????吟ъ?吟ъ?B 001111110011111100111111100010001111101010000100100011000011111110001001010000010011111100111111100010001111101000111111001111110011111100111111001111111000101111100001100001001000110000111111100010111110000110000100100011000011111101000010 3f3f3f88fa848c3f89413f3f88fa3f3f3f3f3f8be1848c3f8be1848c3f42
EUC-JP ???淫ъ?陰??淫?????吟ъ?吟ъ?B 001111110011111100111111101100001111110010100111111011000011111110110001101000100011111100111111101100001111110000111111001111110011111100111111001111111011011011100011101001111110110000111111101101101110001110100111111011000011111101000010 3f3f3fb0fca7ec3fb1a23f3fb0fc3f3f3f3f3fb6e3a7ec3fb6e3a7ec3f42
UTF-8 溜깅젡淫ъ꺃陰곗꺃淫좊젿溜싳꽟吟ъ넱吟ъ꽑B 11101111101001111000101111101010101110011000010111101100101000001010000111100110101101111010101111010001100010101110101010111010100000111110100110011001101100001110101010110011100101111110101010111010100000111110011010110111101010111110110010100010100010101110110010100000101111111110111110100111100010111110110010001011101100111110101010111101100111111110010110010000100111111101000110001010111010111000010010110001111001011001000010011111110100011000101011101010101111011001000101000010 efa78beab985eca0a1e6b7abd18aeaba83e999b0eab397eaba83e6b7abeca28aeca0bfefa78bec8bb3eabd9fe5909fd18aeb84b1e5909fd18aeabd9142
UHC 溜깅젡淫ъ꺃陰곗꺃淫좊젿溜싳꽟吟ъ넱吟ъ꽑B 11101010111111101011000111101011101000001001101011101011111000101010110011101100100000111010110011101011111001001011000011101100100000111010110011101011111000101010000011101011101000001011000111101010111111101001101011101100100001001010110011101011111000011010110011101100100001101011000011101011111000011010110011101100100001001010000001000010 eafeb1eba09aebe2acec83acebe4b0ec83acebe2a0eba0b1eafe9aec84acebe1acec86b0ebe1acec84a042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)