To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???吟????????陰??哀?????鴉 0011111100111111001111111000101111100001001111110011111100111111001111110011111100111111001111110011111110001001010000010011111100111111100010001010001100111111001111110011111100111111001111111110100111101011 3f3f3f8be13f3f3f3f3f3f3f3f89413f3f88a33f3f3f3f3fe9eb
EUC-JP ???吟????????陰??哀?????鴉 0011111100111111001111111011011011100011001111110011111100111111001111110011111100111111001111110011111110110001101000100011111100111111101100001010010100111111001111110011111100111111001111111111001011101101 3f3f3fb6e33f3f3f3f3f3f3f3fb1a23f3fb0a53f3f3f3f3ff2ed
UTF-8 溜깅젡吟룸젿溜쀬꺃溜깅젡陰롮뵮哀얜졋溜깅젡鴉 111011111010011110001011111010101011100110000101111011001010000010100001111001011001000010011111111010111010001110111000111011001010000010111111111011111010011110001011111011001000000010101100111010101011101010000011111011111010011110001011111010101011100110000101111011001010000010100001111010011001100110110000111010111010000110101110111010111011010110101110111001011001001110000000111011001001011010011100111011001010000110001011111011111010011110001011111010101011100110000101111011001010000010100001111010011011010010001001 efa78beab985eca0a1e5909feba3b8eca0bfefa78bec80aceaba83efa78beab985eca0a1e999b0eba1aeebb5aee59380ec969ceca18befa78beab985eca0a1e9b489
UHC 溜깅젡吟룸젿溜쀬꺃溜깅젡陰롮뵮哀얜졋溜깅젡鴉 1110101011111110101100011110101110100000100110101110101111100001101101111110101110100000101100011110101011111110100101111110110010000011101011001110101011111110101100011110101110100000100110101110101111100100100011101110110010010100101011001110010011101110101111101110101110100000101110101110101011111110101100011110101110100000100110101110010010111100 eafeb1eba09aebe1b7eba0b1eafe97ec83aceafeb1eba09aebe48eec94ace4eebeeba0baeafeb1eba09ae4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)