To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??孺??寃??苑??揄??腋???揄?? 1000100101101001001111110011111110011011011111010011111100111111100110111000001100111111001111111000100110010001001111110011111110011101100010010011111100111111111000111111110000111111001111110011111110011101100010010011111100111111 89693f3f9b7d3f3f9b833f3f89913f3f9d893f3fe3fc3f3f3f9d893f3f
EUC-JP 永??孺??寃??苑??揄??腋???揄?? 1011000111001010001111110011111111010101110111100011111100111111110101011110001100111111001111111011000111110001001111110011111111011001111010010011111100111111111001101111111000111111001111110011111111011001111010010011111100111111 b1ca3f3fd5de3f3fd5e33f3fb1f13f3fd9e93f3fe6fe3f3f3fd9e93f3f
UTF-8 永띔벰孺대뱪寃뉒쨼苑앸뒾揄먭쾱腋읦욌뒾揄먭쾱 111001101011000010111000111010111001110110010100111010111011001010110000111001011010110110111010111010111000110010000000111010111011000110101010111001011010111110000011111010111000100110010010111011001010100010111100111010001000101110010001111011001001010110111000111010111001001010111110111001101000111110000100111010111010100010101101111011001011111010110001111010001000010110001011111011001001110110100110111011001001101010001100111010111001001010111110111001101000111110000100111010111010100010101101111011001011111010110001 e6b0b8eb9d94ebb2b0e5adbaeb8c80ebb1aae5af83eb8992eca8bce88b91ec95b8eb92bee68f84eba8adecbeb1e8858bec9da6ec9a8ceb92bee68f84eba8adecbeb1
UHC 永띔벰孺대뱪寃뉒쨼苑앸뒾揄먭쾱腋읦욌뒾揄먭쾱 1110011110110101101101101110101010111010101010001110101011101000101101001110101110010011100100001110101010110010100001111110011110100100100101101110101010111101100111011110101110001010101101001110101011110001100100001110101010110010100001111110010011111101100111111100111010011110111010111000101010110100111010101111000110010000111010101011001010000111 e7b5b6eabaa8eae8b4eb9390eab287e7a496eabd9deb8ab4eaf190eab287e4fd9fce9eeb8ab4eaf190eab287

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)