To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???g?????????g??????? 001111110011111100111111011001110011111100111111001111110011111100111111001111110011111100111111001111110110011100111111001111110011111100111111001111110011111100111111 3f3f3f673f3f3f3f3f3f3f3f3f673f3f3f3f3f3f3f
SJIS-WIN ???g???孺??孺??g???孺??乳 00111111001111110011111101100111001111110011111100111111100110110111110100111111001111111001101101111101001111110011111101100111001111110011111100111111100110110111110100111111001111111001001111111011 3f3f3f673f3f3f9b7d3f3f9b7d3f3f673f3f3f9b7d3f3f93fb
EUC-JP ???g???孺??孺??g???孺??乳 00111111001111110011111101100111001111110011111100111111110101011101111000111111001111111101010111011110001111110011111101100111001111110011111100111111110101011101111000111111001111111100011011111101 3f3f3f673f3f3fd5de3f3fd5de3f3f673f3f3fd5de3f3fc6fd
UTF-8 溜삳젘g溜뷸솹孺숅갱孺싲젘g溜뷸솹孺숅갱乳 1110111110100111100010111110110010000010101100111110110010100000100110000110011111101111101001111000101111101011101101111011100011101100100001101011100111100101101011011011101011101100100010001000010111101010101100001011000111100101101011011011101011101100100010111011001011101100101000001001100001100111111011111010011110001011111010111011011110111000111011001000011010111001111001011010110110111010111011001000100010000101111010101011000010110001111001001011100110110011 efa78bec82b3eca09867efa78bebb7b8ec86b9e5adbaec8885eab0b1e5adbaec8bb2eca09867efa78bebb7b8ec86b9e5adbaec8885eab0b1e4b9b3
UHC 溜삳젘g溜뷸솹孺숅갱孺싲젘g溜뷸솹孺숅갱乳 11101010111111101011101111101011101000001001010001100111111010101111111010111010111001101001100110101110111010101110100010011001111010011011000010111011111010101110100010011010111010111010000010010100011001111110101011111110101110101110011010011001101011101110101011101000100110011110100110110000101110111110101011100001 eafebbeba09467eafebae699aeeae899e9b0bbeae89aeba09467eafebae699aeeae899e9b0bbeae1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)